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£5 (54) Title: ONCOLOGY DRUG INNOVATION 
ON 

jjjg (57) Abstract: The present invention describes methods for identification of molecules expressed at a different level on the cell 
S surface of cancer cells compared to non-malignant cells and methods of identification of cancer specific promoters to be used singly 
* — * or in combination for delivery and expression of therapeutic genes for treatment of cancer. The invention furthermore describes 
2 targeting complexes targeted to cell surface molecules identified by the methods of the invention. In embodiments of the invention 

said targeting complexes comprise the promoters identified by the methods of the invention. In addition the invention describes 
Q methods of identifying binding partners for the cell surface molecules and the binding partners per se. Methods of treatment using 

the targeting complexes and uses of the targeting complexes for the preparation of a medicament are also disclosed by the invention. 
^ Furthermore, the invention describes uses of the cell surface molecules or fragments thereof for preparation of vaccines. 
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Oncology drug innovation 
Field of invention 

5 The present invention relates to methods of identification of molecules on the cell 
surface of cancer cells and a method of identification of cancer specific promoters to 
be used singly or in combination for delivery and expression of therapeutic genes for 
treatment of cancer. 

10 Background of the invention 

Approximately half of all patients with cancer have disseminated disease at the time 
of diagnosis. Existing cancer therapies are able to cure only 5-7% of these patients. 
Consequently, there is a great need for more effective drugs, which can be admin- 
istered systemically alone or in combination with existing treatments. Methods util- 
15 ising gene therapy to deliver efficient and specific treatment of cancer cells is there- 
fore a promising strategy. However, strategies applied to this date have only had 
limited success and the development of suitable delivery systems need further de- 
velopment. 

20 Del i ve ry vectors 

The choice of the delivery vector for gene therapy is a major issue. Many vector 
systems have been tested for their suitability for gene transfer, including viral vec- 
tors such as retrovirus, adenovirus, adeno-associated virus, lentivirus and non viral 
vectors such as complexing with liposomes, cationic lipids or polycations. However, 

25 all of these vectors have specific advantages and limitations. Retrovirus requires 

mitotic division for transduction, but mediate long term expression, as they integrate 
in the genome. Adenovirus will transduce both dividing and non-dividing cells, but 
only transiently as they remain episomal. Adenovirus, however, are highly immuno- 
genic and retrovirus are rapidly inactivated by the human complement system. Len- 

30 tivirus does not induce immune response, but involve specific safety concerns, as it 
is a member of the immunodeficiency virus. More than 75% of all protocols so far 
have used viral vectors despite these are difficult and expensive to produce, there is 
a limited insert size of the therapeutic gene and there are many safety considera- 
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tions to be made. Therefore, the majority of the protocols used for adenoviral vec- 
tors have administered the therapeutic gene by local delivery (injection into the tu- 
mour) to increase the local titer of the virus and avoid immunogenic response, but 
even the highest titer system has not yet been sufficient to cure local tumours. A 

5 major disadvantage of viral vector systems is that their uptake is unspecific and not 
targeted to the cancer cells. However, as adenovirus still is the preferred vector due 
to its efficiency of delivery, ways of reducing the immune response and target the 
virus to specific cells are under development. On the other hand, liposomes and 
polycation complexes, which are less immunogenic, easier to produce and do not 

10 need the safety considerations of viral vectors have much lower transfection effi- 
ciency than viral transduction and also lack the cell specificity. However, polycations 
have the ability to compact and neutralise the charge of the delivered DNA and PEI 
complexes appear relatively stable in the blood system (Goula et.al., 1998; reviewed 
in Mountain, 2000). 

1 5 To assure high specificity and to limit undesired side effects of the treatment, it is of 
importance to design a vector or vehicle, which targets and delivers the therapeutic 
gene in question to the cancer cells efficiently and with high specificity. However, as 
described below, this involves assembly of a multi component vector. 

20 Receptor targeting. 

Functional receptors or other cell surface molecules, which can internalise by ligand 
or antibody binding on the cancer cell surfaces, can be used to target the gene de- 
livery to the cells. Receptor targeted gene delivery by means of DNA conjugated to 
a ligand of the receptor offers a promising approach. The major advantages of tar- 

25 geted gene delivery are that receptor targeting can be performed without virus, thus 
eliminating many of the obstacles present in current strategies of gene therapy. 
Successful deliverance of genes to cancer cells using receptor targeting has been 
reported to a variety of different surface receptors including receptors for epidermal 
growth factor (Cristano and. Roth, 1996, Frederiksen et al., 2000), folate (Gottschalk 

30 et al., 1 994), transferrin (Wagner et al., 1 990). High expression of a specific recep- 
tor is not always a pre-requisite for efficient receptor mediated uptake, as has been 
demonstrated for the epidermal growth factor receptor (Frederiksen et al., 2000). 
However, many of the receptors expressed by cancer cells are also expressed by 
normal cells to some extent, meaning that normal cells will often be targeted as well. 

35 This issue emphasises the need for further requirements for specificity for the ex- 
pression or nature of the therapeutic gene. 
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Molecular conjugates 

For targeted gene therapy it is essential that the ligand to be internalised and DNA 
expressing the therapeutic gene are physically associated for receptor mediated 
5 uptake. Several methods have been used for preparing non-viral, synthetic vectors 
of targeted DNA molecular conjugates by associating cationic polymers, such as 
poly-L-lysine (Frederiksen et al., 2000) or polyethylenimine (PEI) (Kircheis et al., 
1997) (polyplexes) with the ligand and DNA. Successful gene targeting has been 
reported for a number of molecular conjugates. The ligand has either been cova- 

10 lently linked to the polycation, or biotinylated ligand and polylysine were complexed 
via streptavidine to form condensed conjugates with DNA, which are internalised by 
the receptor of the ligand. One of the advantages of these system over virus medi- 
ated transfer is the lack of size limitation of the DNA. PEI complexes, in addition, 
appear to be able to pass the capillary barrier in lung, making this compound one 

1 5 agent for molecular conjugates. 

Endosomal release of molecular conjugate. 

After endocytosis of the DNA/Iigand conjugate by the receptor, the normal pathway 
would lead to degradation and loss of DNA. It has therefore proven essential to in- 

20 elude an endosomolytic agent in the molecular conjugate. Adenovirus, replication 
deficient adenovirus and the viral capside have all proven to be very efficient for 
endosomal lysis, when included in the molecular conjugate. However, all the reser- 
vations of unspecific uptake, safety and immunogenic response applying to use of 
using adenovirus as vectors also apply for this system. Inclusion of other fusogenic 

25 peptides containing amino acid sequences from e.g. influenza virus, toxins or syn- 
thetic peptides in the molecular conjugate have been tested for cytoplasmic release. 
These have the advantage of less immunogenicity and lower cost, but have been 
shown to be less effective in endosomal lysis than adenovirus. However, if the mo- 
lecular conjugate is formed using the polycationic PEI, inclusion of endosomolytic 

30 agents are not necessary, as PEI has an intrinsic endosome-buffering capacity re- 
sulting in endosomal swelling and rupture. 

Cancer specific promoters 

An increase in the specificity of the targeting of a therapeutic gene to cancer cells 
35 can be obtained if a tumour specific promoter controlling the expression can be used 
(reviewed in Nettelbeck et al., 2000). Promoters for genes, whose expression is 
specific for the malignant phenotype, but show no tissue specificity such as telomer- 



WO 03/000928 



PCT/IB02/03534 



4 

ase have been used. Also, promoters regulating oncofetal antigens, which are not 
normally expressed in the adult, have been found to be active in tumor cells, such 
as carcinoembryonic antigen (CEA). However, the activity of these promoters (com- 
pared to strong, constitutive active viral promoters) have often proven not to mediate 
5 sufficient expression of the therapeutic gene, wherefore the tumour specific genes 
have been used for activation of another, stronger promoter controlling the thera- 
peutic gene. Another disadvantage of oncofetal promoters is that these promoters 
will only be active in a subset of tumour types, depending on the tissue origin of the 
tumour. Alternatively, synthetic promoters have been designed taking advantage of 
10 the fact that many oncogenes which are overexpressed in cancer cells are tran- 
scription factors, which can mediate high transcriptional activity from their respective 
DNA recognition sequences. 

Therapeutic genes 

15 The product of a therapeutic gene must be able to effectively induce cell death. 

Gene therapy strategies for cancer treatment have used many different approaches. 
These include immunogene therapy such as cytokine stimulation of immune system 
(enhancing the immune response against tumour cells), selective prodrug activation, 
suicide genes, restoration of tumor suppressor genes and inhibition of activated on- 

20 cogenes (reviewed in Frederiksen et aL, 1999; Gunji et al M 2000). Indeed, most of 
the present therapeutic protocols in clinical trials against cancer involve immuno- 
therapy. However, as the molecular phenotype of many types of cancer regarding 
aberrant expression or mutations of oncogenes and tumour suppressor genes, 
these are obvious candidates to target. Therapeutic gene products reducing expres- 

25 sion or activity of oncogenes, such as antisense RNA or neutralising antibody frag- 
ments, have been tried and shown to inhibit proliferation. However, oncogene inacti- 
vation does not necessarily kill the cells and is therefore probably not applicable for 
short term treatment. One of the at present promising strategies is to reintroduce 
tumour suppressor genes, as most cancer cells exhibit loss of function of one or 

30 more of these genes. Of particular interest is the tumour suppressor gene TP53 en- 
coding p53, which is a transcription factor, which activates genes known to be in- 
volved in cell cycle arrest and induction of apoptosis. Reintroduction of wild type p53 
has been shown to markedly reduce tumour cell growth or induce apoptosis of can- 
cer cells in both in vitro and in vivo systems (Roth et al„ 1996; Nielsen and Maneval, 

35 1998). 

However, gene products rendering cells sensitive to otherwise harmless drugs has 
also been extensively used for gene therapy trials. In particular, the herpes simplex 
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virus thymidine kinase (HSV-tk) in combination with the nucleoside analogue drug 
gangcyclovir has been used. However, the conversion of the drug to a toxic nucleo- 
side analogue by the enzyme only will kill cells, which are dividing. However, the 
toxic products are transmitted to surrounding cells by the so-called "by-stander" ef- 
5 feet, making the approach potential for systems with low targeting efficiency. 



Summary of the invention 

1 0 Accordingly, it is a first objective of the present invention to provide methods for 
identifying a plurality of cell surface molecules, which are expressed at a different 
level in malignant cells compared with normal cells, comprising the steps of: 
i) Providing at least 3 malignant cell lines selected from the group 

consisting of CPH 54 A, CPH 54 B, GLC 2, GLC 3, GLC 14, GLC 16, 

15 GLC 19, GLC 26, GLC 28, DMS 53, DMS 79, DMS 92, DMS 114, 

DMS 153, DMS 273, DMS 406, DMS 456, NCI H69, NCI N417, MAR 
H24, MAR 86 Ml, SHP-77, NCI-H2171, NCI-H2195, NCI-H2196, NCI- 
H2198, NCI-H2227, NCI-H2286, NCI-H2330, NCI-H735, NCI-H1339, 
NCI-H1963, NCI-H2107, NCI-H2108. NCI-H1304, NCI-H1341, NCI- 

20 H1417, NCI-H1436, NCI-H1522, NCI-H1618, NCI-H1672, NCI- 

H1694, NCI-H1836, NCI-H1870, NCI-H1876, NCI-H1882, NCI- 
HI 926, NCI-H1930, NCI-H1994, NCI-H2029. NCI-H2059, NCI- 
H2066, NCI-H2081, NCI-H2141, NCI-H211, NCI-H220, NCI-H250, 
NCI-H524, NCI-H592, NCI-H711, NCI-H719, NCI-H740, NCI-H748, 

25 NCI-H774, NCI-H841, NCI-H847. NCI-H865, NCI-H1048, NCI-H1059, 

NCI-H1092, NCI-H1105, NCI-H1184, NCI-H1238, NCI-H1284, NCI- 
HI 688, NCI-H187, NCI-H378, NCI-H526, NCI-H660, NCI-H889, NCI- 
H60, NCI-H196, NCI-H446, NCI-H209, NCI-H146, NCI-H82, NCI- 
H460, NCI-H345, NCI-H510A, NCI-128, NCI-446, SW 1271 

30 ii) Providing at least 3 total RNA samples derived from normal tissue 

selected from the group consisting of liver, heart, kidney, lung, 
adrenal gland, colon, pancreas, small intestine, spleen, skeletal 
muscle, trachea, prostate, placenta, salivary gland, testes, 
leucocytes, leucocytes, brain, adipose tissue, bladder, breast, cervix, 
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esophagus, larynx, ovary, rectum, skin, spinal cord, stomach, thymus, 
thyroid and uterus. 

. iii) Comparing the expression of mRNA in the cell lines according to step 
i) and tissue samples according to step ii) 
5 iv) Identifying nucleic acid sequences, wherein 

a) there is a difference between the amount of 
mRNA expressed in one or more cell lines 
according to i) and the amount of mRNA 
expressed in one or more tissues according to ii); 

10 and/or 

b) there is essentially no difference in the amount of 
mRNA expressed in at least two cell lines 
according to i); and/or 

c) there is essentially no difference in the amount of 
1 5 mRNA expressed in at least two tissue samples 

according to ii); and 
v) Selecting among the nucleic acid sequences according to iv), nucleic 
acid sequences encoding for potential cell surface molecules. 

20 It is a second objective of the present invention to provide methods of identifying first 
nucleic acid sequences, which are capable of directing expression of second nucleic 
acid sequences operably linked thereto, wherein the level of said expression is 
different in malignant cells compared with normal cells comprising the steps of: 

25 i) Providing at least 3 malignant cell lines selected from the group 

consisting of CPH 54 A, CPH 54 B, GLC 2, GLC 3, GLC 14, GLC 16, 
GLC 19, GLC 26, GLC 28, DMS 53, DMS 79, DMS 92, DMS 114, 
DMS 153, DMS 273, DMS 406, DMS 456, NCI H69, NCI N417, MAR 
H24, MAR 86 Ml, SHP-77, NCI-H2171, NCI-H2195, NCI-H2196, NCI- 

30 H2198, NCI-H2227, NCI-H2286, NCI-H2330, NCI-H735, NCI-H1339, 

NCL-H1963, NCI-H2107, NCI-H2108, NCI-H1304, NCI-H1341, NCI- 
H1417, NCI-H1436, NCI-H1522, NCI-H1618, NCI-H1672, NCI- 
H1694, NCI-H1836, NCI-H1870, NCI-H1876, NCI-H1882, NCI- 
H1926, NCI-H1930, NCWH1994, NCI-H2029, NCI-H2059, NCI- 

35 H2066, NCI-H2081 , NCI-H2141 , NCI-H21 1 , NCI-H220, NCI-H250, 
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NCI-H524, NCI-H592, NCI-H711, NCI-H719, NCI-H740, NCI-H748, 
NCI-H774, NCI-H841, NCI-H847, NCI-H865, NCI-H1048, NCI-H1059, 
NCI-H1092, NCI-H1105, NCI-H1184, NCI-H1238, NCI-H1284, NCI- 
HI 688, NCI-H187, NCI-H378, NCI-H526, NCI-H660, NCI-H889, NCI- 
5 H60, NCI-H196, NCI-H446, NCI-H209, NCI-H146, NCI-H82, NCI- 

H460, NCI-H345, NCI-H510A, NCI-128, NCI-446, SW 1271 

ii) Providing at least ,3 RNA samples derived from normal tissue 
samples derived from the group consisting of liver, heart, kidney, 
lung, adrenal gland, colon, pancreas, small intestine, spleen, skeletal 

10 muscle, trachea, prostate, placenta, salivary gland, testes, 

leucocytes, brain, adipose tissue, bladder, breast, cervix, esophagus, 
larynx, ovary, rectum, skin, spinal cord, stomach, thymus, thyroid and 
uterus. 

iii) Comparing the expression of mRNA in the cell lines according to i) 
1 5 and tissue samples according to ii) 

iv) Identifying second nucleic acid sequences, wherein 

a) there is a difference between the amount of 
mRNA expressed in one or more cell lines 
according to i) and the amount of mRNA 

20 expressed in one or more tissues according to ii); 

and/or 

b) ( there is essentially no difference in the amount of 
mRNA expressed in at least two cell lines 
according to i); and/or 

25 c) there is essentially no difference in the amount of 

mRNA expressed in at least two tissue samples 
according to ii) 

v) Identifying first nucleic acid sequences operably linked to the second 
nucleotide sequences identified in step iv) 

30 

It is a third objective of the present invention to provide uses of a pharmaceutical^ 
effective amount of the cell surface molecules identified according to the present 
invention for the preparation of a vaccine. Furthermore, the present invention pro- 
vides uses of a pharmaceutical^ effective amount of a nucleic acid sequence en- 
35 coding a cell surface molecule identified according to the methods of the present 
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invention for the preparation of a vaccine. The present invention also provides uses 
of a pharmaceutically effective amount of a cell surface molecule and/or a nucleic 
acid sequence encoding such a cell surface molecule for the preparation of a vac- 
cine, wherein said cell surface molecule preferably comprises or essentially consists 
5 of or for example is GRIA2, such as LPR8, for example is CHRNA5, such as 

TMEFF, for example is NPTXR, such asTransferrin receptor; such as type II mem- 
brane protein clone: for example is HP10481 ; such as type II membrane protein 
clone: such as HP10390; for example is PG40; such as TRC8 ; for example is TR2- 
1 1 ; such as OA3 antigenic surface determinant; for example is integrin alpha 6, For 

10 example GPIIb; such as vitronectin receptor alpha subunit; for example is integrin 
alpha-7; such as integrin alpha E precursor; for example is integrin alpha 6B; such 
as integrin alpha 5 subunit; for example is integrin beta-5 subunit; such as integrin 
alpha-3 chain; for example is RYK; such as amyloid precursor protein-binding pro- 
tein 1; for example is putative transmembrane GTPase; such as membrane cofactor 

15 protein; FOR EXAMPLE GLVR1; for example is Mr 1 10,000 antigen; for example is 
syndecan-1; such as putative seven transmembrane domain protein; for example is 
LCA-homolog/ LAR protein; such as M6 antigen; for example is Me491/CD63 anti- 
gen; such as multispanning membrane protein; for example is DDR; such as auto- 
crine motility factor receptor, for example is insulin receptor precursor; such as 

20 IGF1R, for example is insulin-like growth factor II receptor; such as SAS; for exam- 
ple is TAPA-1 ; such as MICB; for example is MHC class II HLA-DR7-associated 
glycoprotein beta-chain; such as HLA-DP; for example is bone small proteoglycan I 
biglycan; such as CAR; for example is MEA11; such as interferon-gamma receptor 
alpha chain; for example is Polymeric immunoglobulin receptor; such as metabo- 

25 tropic glutamate receptor type 4; for example is metabotropic glutamate receptor 8; 
such as CLPTM1; for example is MAGE-4b; such as MAGESa; for example is 
MAGE-3; such as MAGE-1 ; for example is MAGE6; such as MAGE-9; for example 
is MAGE11; such as CD24; for example is CD59; such as CD44; for example is low 
density lipoprotein receptor; such as very low density lipoprotein receptor; for exam- 

30 pie is N-CAM; such as lamin B receptor homolog TM7SF2; for example is putative 
T1/ST2 receptor binding protein precursor; such as NTR2 receptor; for example is 
RAGE-4; such as HLA-G1; for example is MOAT-C; such as alpha 2 delta calcium 
channel subunit isoform I; for example is LFA-3; such as L1-CAM; for example is 
AVPR2; such as C1 p1 1 5 C1 ; for example is TE2; such as RbP; for example is 

35 HCF1; such as IRAK; for example is CD151; such as surface antigen; for example is 



WO 03/000928 



PCT/IB02/03534 



9 

MAG; such as GPR19; for example is pcta-1; such as PRAME; for example is vaso- 
pressin activated calcium mobilizing receptor-like protein; such as serotonin receptor 
5-HT4B; for example is serotonin. 1D receptor (5-HT1D-); such as CD9; for example 
is LDL receptor member LR3; such as DR6; for example is tumor necrosis factor 
5 receptor; such as HG38; for example is urokinase-type plasminogen receptor; such 
as FGF receptor; for example is nerve growth factor receptor; such as cys- 
tine/glutamate transporter; for example is CB1 cannabinoid receptor (CNR1); such 
as PSG; for example is PSG13'; such as CPE-receptor; for example is CRH2R; 
such as OCI5; for example is TRAIL receptor 2; such as HNMP-1 ; for example is 

10 kidney alpha-2-adrenergic receptor; such as erythropoietin receptor; for example is 
chondroitin sulphate proteoglycan versican V1; for example is mGluRlbeta; such as 
CD97; for example is L6; such as NY-ESO-1; for example is T-cell receptor alpha 
delta; such as rorl ; for example is ror2; such as SSTR2; for example is VESPR; 
such as IgG Fc receptor; for example is glutamate receptor subunit GluRC; such as 

15 HEK2; for example, is PVR; such as CEA; for example is CC-chemokine-binding 

receptor JAB61; such as HER2; for example is HER3; such as hypothetical protein 
FLJ22357 similar to Epidermal growth factor receptor-related protein; for example is 
putative endothelin receptor type B-like protein; such as GLVR2; for example is 
P2X4 purinoreceptor; such as FPRL1; for example is Atrial natriuretic peptide clear- 

20 ance receptor; for example is gastrin/CCK-B receptor; such as Neuromedin* B re- 
ceptor; for example is GFRA3; such as GRPR; for example is CDH1; such as 
CDH2; for example is TGFBR1; such as TGFBR2; for example is TGFBR3; such as 
precursor of epidermal growth factor receptor. 

25 It is a fourth objective of the present invention to provide uses of a cell surface 

molecule identified according to the methods described by the present invention as 
a drug target, wherein said drug target is capable of binding a binding partner and 
internalising said binding partner into cells expressing said cell surface molecule. 
Furthermore, the present invention provides uses of a cell surface molecule which 

30 preferably comprises or essentially consists of or for example is GRIA2, such as 
LPR8, for example is CHRNA5, such as TMEFF, for example is NPTXR, such as 
Transferrin receptor; such as type II membrane protein clone: for example is 
HP10481; such as type II membrane protein Clone: such as HP1O390; for example 
is PG40; such as TRC8 ; for example is TR2-1 1; such as OA3 antigenic surface 

35 determinant; for example is integrin alpha 6, For example GPIIb; such as vitronectin 
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receptor alpha subunit; for example is integrin alpha-7; such as integrin alpha E 
precursor; for example is integrin alpha 6B; such as integrin alpha 5 subunit; for 
example is integrin beta-5 subunit; such as integrin alpha-3 chain; for example is 
RYK; such as amyloid precursor protein-binding protein 1; for example is putative 
5 transmembrane GTPase; such as membrane cofactor protein; FOR EXAMPLE 
GLVR1 ; for example is Mr 1 10,000 antigen; for example is syndecan-1 ; such as 
putative seven transmembrane domain protein; for example is LCA-homolog/ LAR 
protein; such as M6 antigen; for example is Me491/CD63 antigen; such as 
multispanning membrane protein; for example is DDR; such as autocrine motility 

10 factor receptor; for example is insuljn receptor precursor; such as IGF1R, for 

example is insulin-like growth factor II receptor; such as SAS; for example is TAPA- 
1; such as MICB; for example is MHC class II HLA-DR7-associated glycoprotein 
beta-chain; such as HLA-DP; for example is bone small proteoglycan I biglycan; 
such as CAR; for example is MEA11; such as interferon-gamma receptor alpha 

15 chain; for example is Polymeric immunoglobulin receptor; such as metabotropic 

glutamate receptor type 4; for example is metabotropic glutamate receptor 8; such 
as CLPTM1; for example is MAGE-4b; such as MAGESa; for example is MAGE-3; 
such as MAGE-1; for example is MAGE6; such as MAGE-9; for example is 
MAGE11; such as CD24; for example is CD59; such as CD44; for example is low 

20 density lipoprotein receptor; such as very low density lipoprotein receptor; for 
example is N-CAM; such as lamin B receptor homolog TM7SF2; for example is 
putative T1/ST2 receptor binding protein precursor; such as NTR2 receptor; for 
example is RAGE-4; such as HLA-G1; for example is MOAT-C; such as alpha 2 
delta calcium channel subunit isoform I; for example is LFA-3; such as L1-CAM; for 

25 example is AVPR2; such as C1 p1 1 5 C1 ; for example is TE2; such as RbP; for 

example is HCF1 ; such as IRAK; for example is CD151 ; such as surface antigen; for 
example is MAG; such as GPR19; for example is pcta-1; such as PRAME; for 
example is vasopressin activated calcium mobilizing receptor-like protein; such as 
serotonin receptor 5-HT4B; for example is serotonin 1D receptor (5-HT1D-); such 

30 as CD9; for example is LDL receptor member LR3; such as DR6; for example is 
tumor necrosis factor receptor; such as HG38; for example is urokinase-type 
plasminogen receptor; such gs FGF receptor; for example is nerve growth factor 
receptor; such as cystine/glutamate transporter; for example is CB1 cannabinoid 
receptor (CNR1); such as PSG; for example is PSG13'; such as CPE-receptor; for 

35 example is CRH2R; such as OCI5; for example is TRAIL receptor 2; such as HNMP- 
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1; for example is kidney alpha-2-adrenergic receptor; such as erythropoietin 
receptor; for example is chondroitin sulphate proteoglycan versican V1; for example 
is mGluRI beta; such as CD97; for example is L6; such as NY-ESO-1 ; for example 
is T-cell receptor alpha delta; such as rorl; for example is ror2; such as SSTR2; for 
5 example is VESPR; such as IgG Fc receptor; for example is glutamate receptor 
subunit GluRC; such as HEK2; for example is PVR; such as CEA; for example is 
CC-chemokine-binding receptor JAB61; such as HER2; for example is HER3; such 
as hypothetical protein FLJ22357 similar to Epidermal growth factor receptor-related 
protein; for example is putative endothelin receptor type B-like protein; such as 

10 GLVR2; for example is P2X4 purinorecepton such as FPRL1 ; for example is Atrial 
natriuretic peptide clearance receptor; for example is gastrin/CCK-B receptor; such 
as Neuromedin B receptor; for example is GFRA3; such as GRPR; for example is 
CDH1; such as CDH2; for example is TGFBR1; such as TGFBR2; for example is 
TGFBR3; such as precursor of epidermal growth factor receptor as drug target, 

15 wherein said drug target is capable of binding a binding partner and internalising 
said binding partner into cells expressing said cell surface molecule. 

i 

It is a fifth objective of the present invention to provide methods of identifying and/or 
preparing specific binding partners comprising the steps of 

20 

i) Providing a cell surface molecule identified by the methods described 
by the present invention 

ii) Identifying and/or preparing binding partners capable of associating 
with said cell surface molecules 

25 

It is furthermore an objective of the present invention to provide methods of 
identifying and/or preparing specific binding partners comprising the steps of 

i) Providing a cell surface molecule which preferably comprises or 
30 essentially consists of or for example is Transferrin receptor; such as 

type II membrane protein clone: for example is HP10481; such as 
type II membrane protein clone: such as HP10390; for example is 
PG40; such as TRC8 ; for example is TR2-1 1 ; such as OA3 antigenic 
surface determinant; for example is integrin alpha 6, For example 
35 GPIIb; such as vitronectin receptor alpha subunit; for example is 
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integrin alpha-7; such as integrin alpha E precursor; for example is 
integrin alpha 6B; such as integrin alpha 5 subunit; for example is 
integrin beta-5 subunit; such as integrin alpha-3 chain; for example is 
RYK; such as amyloid precursor protein-binding protein 1; for 
example is putative transmembrane GTPase; such as membrane 
cofactor protein; FOR EXAMPLE GLVR1; for example is Mr 110,000 
antigen; for example is syndecan-1; such as putative seven 
transmembrane domain protein; for example is LCA-homolog/ LAR 
protein; such as M6 antigen; for example is Me491/CD63 antigen; 
such as multispanning membrane protein; for example is DDR; such 
as autocrine motility factor receptor; for example is insulin receptor 
precursor; such as IGF1R, for example is insulin-like growth factor II 
receptor; such as SAS; for example is TAPA-1; such as MICB; for 
example is MHC class II HLA-DR7-associated glycoprotein beta- 
chain; such as HLA-DP; for example is bone small proteoglycan I 
biglycan; such as CAR; for example is MEA1 1; such as interferon- 
gamma receptor alpha chain; for example is Polymeric 
immunoglobulin receptor; such as metabotropic glutamate receptor 
type 4; for example is metabotropic glutamate receptor 8; such as 
CLPTM1; for example is MAGE-4b; such as MAGESa; for example is 
MAGE-3; such as MAGE-1; for example is MAGE6; such as MAGE-9; 
for example is MAGE1 1 ; such as CD24; for example is CD59; such 
as CD44; for example is low density lipoprotein receptor; such as very 
low density lipoprotein receptor; for example is N-CAM; such as lamin 
B receptor homolog TM7SF2; for example is putative T1/ST2 receptor 
binding protein precursor; such as NTR2 receptor; for example is 
RAGE-4; such as HLA-G1; for example is MOAT-C; such as alpha 2 
delta calcium channel subunit isoform I; for example is LFA-3; such 
as L1-CAM; for example is AVPR2; such as C1 p1 15 C1; for example 
is TE2; such as RbP; for example is HCF1; such as IRAK; for 
example is CD151; such as surface antigen; for example is MAG; 
such as GPR19; for example is pcta-1; such as PRAME; for example 
is vasopressin activated calcium mobilizing receptor-like protein; such 
as serotonin receptor 5-HT4B; for example is serotonin 1 D receptor 
(5-HT1D-); such as CD9; for example is LDL receptor member LR3; 
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such as DR6; for example is tumor necrosis factor receptor; such as 
HG38; for example is urokinase-type plasminogen receptor; such as 
FGF receptor; for example is nerve growth factor receptor; such as 
cystine/glutamate transporter; for example is CB1 cannabinoid 
5 receptor (CNR1 ); such as PSG; for example is PSG13 1 ; such as CPE- 

receptor; for example is CRH2R; such as OCI5; for example is TRAIL 
receptor 2; such as HNMP-1 ; for example is kidney alpha-2- 
adrenergic receptor; such as erythropoietin receptor; for example is 
chondroitin sulphate proteoglycan versican V1; for example is 

10 mGluRlbeta; such as CD97; for example is L6; such as NY-ESO-1 ; 

for example is T-cell receptor alpha delta; such as rorl; for example is 
ror2; such as SSTR2; for example is VESPR; such as IgG Fc 
. receptor; for example is glutamate receptor subunit GluRC; such as 
HEK2; for example is PVR; such as CEA; for example is CC- 

1 5 chemokine-binding receptor JAB61 ; such as HER2; for example is 

HER3; such as hypothetical protein FLJ22357 similar to Epidermal 
growth factor receptor-related protein; for example is putative 
endothelin receptor type B-like protein; such as GLVR2; for example 
is P2X4 purinoreceptor; such as FPRL1; for example is Atrial 

20 natriuretic peptide clearance receptor; for example is gastrin/CCK-B 

receptor; such as Neuromedin B receptor; for example is GFRA3; 
such as GRPR; for example is CDH1 ; such as CDH2; for example is 
TGFBR1; such as TGFBR2; for example is TGFBR3; such as 
precursor of epidermal growth factor receptor. 

25 ii) Identifying and/or preparing binding partners capable of associating 

with said cell surface molecules. 

A further objective of the present invention is to provide isolated and/or purified 
specific binding partners capable of associating with, cell surface molecules, which 

30 are expressed at a different level in malignant cells compared with normal cells, 

identified by the methods provided by the present invention. The present invention 
also provides isolated and/or purified specific binding partners capable of 
associating with a cell surface molecule which preferably comprises or essentially 
consists of or for example is GRIA2, such as LPR8, for example is CHRNA5, such 

35 as TMEFF, for example is NPTXR, such as Transferrin receptor; such as type II 
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membrane protein clone: for example is HP10481; such as type II membrane 
protein clone: such as HP10390; for example is PG40; such as TRC8 ; for example 
is TR2-1 1; such as OA3 antigenic surface determinant; for example is integrin alpha 
6, For example GPIIb; such as vitronectin receptor alpha subunit; for example is 
5 integrin alpha-7; such as integrin alpha E precursor; for example is integrin alpha 
6B; such as integrin alpha 5 subunit; for example is integrin beta-5 subunit; such as 
integrin alpha-3 chain; for example is RYK; such as amyloid precursor protein- 
binding protein 1 ; for example is putative transmembrane GTPase; such as 
membrane cofactor protein; FOR EXAMPLE GLVR1; for example is Mr 110,000 

10 antigen; for example is syndecan-1; such as putative seven transmembrane domain 
protein; for example is LCA-homolog/ LAR protein; such as M6 antigen; for example 
is Me491/CD63 antigen; such as multispanning membrane protein; for example is 
DDR; such as autocrine motility factor receptor; for example is insulin receptor 
precursor; such as IGF1R, for example is insulin-like growth factor II receptor; such 

15 as SAS; for example is TAPA-1; such as MICB; for example is MHC class II HLA- 
DR7-associated glycoprotein beta-chain; such as HLA-DP; for example is bone 
small proteoglycan I biglycan; such as CAR; for example is MEA11; such as 
interferon-gamma receptor alpha chain; for example is Polymeric immunoglobulin 
receptor; such as metabotropic glutamate receptor type 4; for example is 

20 metabotropic glutamate receptor 8; such as CLPTM1 ; for example is MAGE-4b; 
such as MAGESa; for example is MAGE-3; such as MAGE-1; for example is 
MAGE6; such as MAGE-9; for example is MAGE1 1 ; such as CD24; for example is 
CD59; such as CD44; for example is low density lipoprotein receptor; such as very 
low density lipoprotein receptor; for example is N-CAM; such as lamin B receptor 

25 homolog TM7SF2; for example is putative T1/ST2 receptor binding protein 

precursor; such as NTR2 receptor; for example is RAGE-4; such as HLA-G1; for 
example is MOAT-C; such as alpha 2 delta calcium channel subunit isoform I; for 
example is LFA-3; such as L1-CAM; for example is AVPR2; such as C1 p1 15 C1 ; 
for example is TE2; such as RbP; for example is HCF1 ; such as IRAK; for example 

30 is CD151; such as surface antigen; for example is MAG; such as GPR19; for 

example is. pcta-1 ; such as PRAME; for example is vasopressin activated calcium 
mobilizing receptor-like protein; such as serotonin receptor 5-HT4B; for example is 
serotonin ID receptor (5-HT1D-); such as CD9; for example is LDL receptor 
member LR3; such as DR6; for example is tumor necrosis factor receptor; such as 

35 HG38; for example is urokinase-type plasminogen receptor; such as FGF receptor; 
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for example is nerve growth factor receptor; such as cystine/glutamate transporter; 
for example is CB1 cannabinoid receptor (CNR1); such as PSG; for example is 
PSG1 3 1 ; such as CPE-receptor; for example is CRH2R; such as OCI5; for example 
is TRAIL receptor 2; such as HNMP-1 ; for example is kidney alpha-2-adrenergic 

.5 receptor; such as erythropoietin receptor; for example is chondroitin sulphate 

proteoglycan versican V1; for example is mGluRlbeta; such as CD97; for example 
is L6; such as NY-ESO-1; for example is T-cell receptor alpha delta; such as rorl; 
for example is ror2; such as SSTR2; for example is VESPR; such as IgG Fc 
receptor; for example is glutamate receptor subunit GluRC; such as HEK2; for 

10 example is PVR; such as CEA; for example is CC-chemokine-binding receptor 

JAB61; such as HER2; for example is HER3; such as hypothetical protein FLJ22357 
similar to Epidermal growth factor receptor-related protein; for example is putative 
endothelin receptor type B-like protein; such as GLVR2; for example is P2X4 
purinoreceptor; such as FPRL1; for example is Atrial natriuretic peptide clearance 

15 receptor; for example is gastrin/CCK-B receptor; such as Neuromedin B receptor; 
for example is GFRA3; such as GRPR; for example is CDH1 ; such as CDH2; for 
example is TGFBR1; such as TGFBR2; for example is TGFBR3; such as precursor 
of epidermal growth factor receptor. 

20 It is also an objective of the present invention to provide methods of identifying novel 
drug targets, comprising the steps of 

i) Providing a binding partner as described in the present invention 

ii) Identifying potential drug targets capable of associating with said 
25 binding partner 

It is yet another objective of the present invention to provide drug targets identified 
by the methods described by the present invention. 

30 Furthermore, it is an objective of the present invention to provide targeting 
complexes comprising: 

i) A binding partner as described by the present invention; and 

ii) A bioreactive species 
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wherein the targeting complex is capable of binding a cell surface molecule 
identified according to the methods described by the present invention and 
capable of being internalised into cells bearing said cell surface molecule. 

5 The present invention also provides uses of binding partners as describes by the 
invention for the preparation of targeting complexes according to the invention. 

It is yet a further objective of the present invention to provide pharmaceutical 
compositions comprising of the targeting complexes described by the present 
1 0 invention together with a pharmaceutically acceptable carrier. 

It is even a further objective of the present invention to provide methods of treatment 
of a premalignant and/or malignant conditions in an individual in need thereof, com- 
prising administering to said individual a pharmaceutically effective amount of the 
15 targeting complexes described by the present invention. 

Furthermore, it is an objective of the present invention to provide uses of the 
targeting complex described by the present invention for the preparation of a 
medicament for the treatment of a premalignant and/or malignant conditions in an 
20 individual in need thereof. 

Legend to figures 

Fig. 1 illustrates the principle of targeted gene therapy. 

25 

Fig. 2 illustrates a comparison between gene expression measured by Chips analy- 
sis and RT-PCR. The figure shows a quality test of cDNA used for RT-PCR valida- 
tion of Chips analysis by RT- PCR of Glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH). . 

30 

Fig. 3 illustrates a comparison between gene expression measured by Chips analy- 
sis and RT-PCR for Pro 221(IA-1). 
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Fig. 4. illustrates a comparison between gene expression measured by Chips analy- 
sis and RT- PCR of Pro 30 (KIA0042). 

Fig. 5. illustrates a comparison between gene expression measured by Chips analy- 
5 sis and RT- PCR of Pro 41 (MAD2). 

Fig. 6. illustrates a comparison between gene expression measured by Chips analy- 
sis and RT- PCR of Pro 21 0 (lamin B1). 

10 Fig. 7 illustrates a comparison between gene expression measured by Chips analy- 
sis and RT- PCR of Pro 71 (CDKN2A). 

Fig. 8 illustrates a comparison between gene expression measured by Chips analy- 
sis and RT-PCR of cell surface molecule DR6. 

Fig. 9. illustrates a comparison between gene expression measured by Chips analy- 
sis and RT-PCR of cell surface molecule LRP8. 

Fig. 10. illustrates a comparison between gene expression measured by Chips 
20 analysis and RT-PCR of cell surface molecule NTPXR. 

Fig. 1 1 . illustrates a comparison between gene expression measured by Chips 
analysis and RT-PCR of cell surface molecule NCAM1. 

25 Fig. 12A illustrates a comparison between gene expression measured by Chips 
analysis and RT-PCR of cell surface molecule GluR2 (GRIA2). 

Fig. 12B illustrates a comparison between gene expression measured by Chips 
analysis and RT-PCR of cell surface molecule ITGAV. 

30 

Fig. 13 illustrates a comparison between gene expression measured by Chips 
analysis and western blotting of mGluR8. 

Fig. 14. illustrates a comparison between gene expression measured by Chips 
35 analysis and western blot analysis for NPTXR. 
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Fig. 15. illustrates a comparison between gene expression measured by Chips 
analysis and western blot analysis for NCAM1 . 

5 Fig. 16. illustrates a comparison between gene expression measured by Chips 
analysis and western blot analysis for GluR2 (GRIA2). 

Fig. 17. illustrates a comparison between gene expression measured by Chips 
analysis and western blot analysis for ITGAE. 

10 

Detailed description of the invention 
Definitions 

15 Binding partner: See "cell surface molecule binding partner". 

Bioreactive species: Any molecule, which can directly or indirectly exert a biological 
influence on a target cell. 

20 Be: Base pair 

Cell surface molecules : Molecules naturally associated with the cell surface. 

Cell surface molecule binding partner: Any molecule that can associate specifically 
25 with a cell surface molecule. Throughout the text the terms "Cell surface molecule 

binding partner" and the shorter term "binding partner" are used interchangeably and 
both terms are equivalent to one another throughout the text. 

Enhancer: Nucleic acid sequence, which can enhance the transcription of a second 

30 nucleic acid sequence operably linked thereto. 

i • i 

First nucleic acid seouences: Nucleic acid sequences, which are capable of 
directing expression of second nucleic acid sequences operably linked thereto. 



WO 03/000928 



PCT/IB02/03534 



19 

Normal cells: Non-malignant cells that are of non-malignant origin. 
Normal tissue : Non-malignant tissue 

5 

Promoter: First nucleic acid sequences, which are capable of directing expression of 
second nucleic acid sequences operably linked thereto. 

Second nucleic acid sequences: Nucleic acid sequences, which are capable of 
10 being expressed, such as mRNA may be transcribed from such nucleic acid 
sequences, when they are operably linked to first nucleic acid sequences. 

Silencer: A nucleic acid sequence, which is capable of repressing the transcription 
of a second nucleic acid sequence operably linked thereto. 

15 

Targeting complex: Complex which comprises at least one binding partner and a 
bioreactive species and which is capable of be internalised into cells. 

Embodiments of the invention 

20 It is becoming increasingly obvious, that if gene therapy of cancer is to become an 
effective, alternative or adjuvant treatment of cancer, in particular of disseminated 
disease, several requirements must be resolved. These include for example i) tar- 
geting of the complex to be efficient and cancer specific; ii) expression of the thera- 
peutic gene to be efficient and cancer specific and iii) that the molecular conjugate is 

25 non-immunogenic, has a high stability after systemic administration and is able to 
cross capillary barriers. 

In one preferred embodiment, the present invention relates to the use of novel, high 
throughput screening methods for identification of genes specifically expressed by 
30 cancer cells and their application for double-targeted gene transfer and expression 
of therapeutic genes for treatment of cancer. An example ofthe principle of double- 
targeted gene transfer is outlined in figure 1. The screening methods according to 
the present invention enable the identification of novel molecules expressed by the 
cancer cells. 

35 



s i 
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In one embodiment the method will be applied on identification of gene expression 
of suitable molecules expressed by small cell lung cancer (SCLC) cells. Small cell 
lung cancer is a highly aggressive neoplasm, comprising of approximately 25% of all 
lung cancer cases. The disease is almost always disseminated at the time of diag- 

5 nosis. SCLC is treated with different chemotheraputic drugs alone or in combination 
with radiation therapy. Despite intensive attempts to improve treatment, and re- 
gardless of the fact that most patients respond well to the treatment in the begin- 
ning, the mortality rate is high. Existing cancer treatments are able to cure only 5-7% 
of these patients and the 5-year survival rate is extremely poor (5-15%). SCLC pa- 

1 0 tients therefore are in great need of the development of new therapies. 

The molecular phenotype of the disease has been thoroughly characterised and the 
aberrant expression of oncogenes (particularly of the myc-family) in addition to the 
loss of function of several tumour suppressor genes (such as p53 and Rb) have 

15 been found for more than 80% of SCLC tumours. These phenotypes are also found 
most cell lines deriving from SCLC tumours (reviewed in Frederiksen et al., 1999), 
allowing the cell lines to be used as an experimental tool for in vitro testing of poten- 
tial anticancer drugs. In addition, these cell lines can be propagated in vivo in nude 
mice, thus allowing testing of developed drugs an in vivo situation. Therefore cell 

20 lines derived from SCLC will be used for the initial screening of gene expression for 
identification of cancer specific (or highly expressed) surface molecules and regions 
with particular transcriptional activity (promoters) in SCLC cells. 

It is preferred that a variety of different SCLC cell lines established by different labo- 
25 ratories and from different patients are used with the present invention, in order to 
identify genes expressed in a large number of SCLCs. Furthemore, it is preferred 
that expression in these SCLC cell lines is compared with expression in a variety of 
normal tissues, which preferably is representative of different tissues of endodermal, 
ectodermal and mesodermal origin. 

30 

It is possible within the present invention to apply a biphasic strategy, however in 
certain embodiments of the. present invention other strategies may be applied. A 
biphasic strategy according to the present invention may for example be a gene 
therapy drug that via systemic administration can target cancer cells effectively 
35 through binding to functional, transport-competent receptors on the surface and 

which subsequently allows expression of the gene effectively in the cancer cells by a 
promoter, which is specifically active or hyperactive in the cancer cells. 
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Amino acids and nucleic acids 

Throughout the description and claims either the one letter code or the three letter 
5 code for natural amino acids are used. Where the L or D form has not been speci- 
fied it is to be understood that the amino acid in question has the natural L form, cf. 
Pure & Appl. Chem. Vol. (56(5) pp 595-624 (1984) or the D form, so that the pep- 
tides formed may be constituted of amino acids of L form, D form, or a sequence of 
mixed L forms and D forms. 

.10 

Where nothing is specified it is to be understood that the C-terminal amino acid of a 
polypeptide of the invention exists as the free carboxylic acid, this may also be 
specified as "-OH". The N-terminal amino acid of a polypeptide comprise a free 
amino^group, this may also be specified as "H-\ 

1 5 

Where nothing else is specified amino acid can be selected from any amino acid, 
whether naturally occurring or not, such as alpha amino acids, beta amino acids, 
and/or gamma amino acids. Accordingly, the group comprises but are not limited to: 
Ala, Val, Leu, He, Pro, Phe, Trp, M6t, Gly, Ser, Thr, Cys, Tyr, Asn, Gin, Asp, Glu, 
20 Lys, Arg, His, Aib, Nal, Sar, Orn, Lysine analogues DAP and DAPA. 

The term "nucleic acid" is meant to encompass DNA and RNA as well as derivatives 
thereof such as peptide nucleic acids (PNA) or locked nucleic acids (LNA) 
throughout the description. 



Methods to identify cell surface molecules and promoters 

The methods used to identify cell surface molecules and/or first nucleic acid 
sequences, which are capable of directing expression of second nucleic acid 
30 sequences operably linked thereto according to the present invention preferably 
involve the comparison of levels of mRNA found in malignant cell lines with the 
levels of mRNA found in normal tissues. 
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Preferably, the malignant cell lines according to the present invention are 
mammalian cell lines, more preferably human cell lines. Yet more preferably, the cell 
lines are derived from small cell lung carcinomas (SCLC). Even more preferably, the 
cell lines are selected from the group consisting of CPH 54 A, CPH 54 B, GLC 2, 
5 GLC 3, GLC 14, GLC 16, GLC 19, GLC 26, GLC 28, DMS 53, DMS 79, DMS 92, 

DMS 1 14, DMS 153, DMS 273, DMS 406, DMS 456, NCI H69, NCI N417, MAR H24 
and MAR 86 Ml, SHP-77, NCI-H2171, NCI-H2195, NCI-H2196, NCI-H2198, NCI- 
H2227, NCI-H2286, NCI-H2330, NCI-H735, NCI-H1339, NCI-H1963. NCI-H2107, 
NCI-H2108, NCI-H1304, NCI-H1341, NCI-H1417, NCI-H1436, NCI-H1522, NCI- 

10 H1618, NCI-H1672, NCI-H1694, NCI-H1836, NCI-H1870, NCI-H1876, NCI-H1882, 
NCI-H1926, NCI-H1930, NCI-H1994, NCI-H2029, NCI-H2059, NCI-H2066, NCI- 
H2081, NCI-H2141, NCI : H211. NCI-H220, NCI-H250, NCI-H524, NCI-H592, NCI- 
H711, NCI-H719. NCI-H740, NCI-H748. NCI-H774, NCI-H841. NCI-H847, NCI- 
H865, NCI-H1048, NCI-H1059, NCI-H1092, NCI-H1105, NCI-H1184, NCI-H1238, 

1 5 NCI-H 1 284, NCI-H1 688, NCI-H1 87, NCI-H378, NCI-H526, NCI-H660, NCI-H889, 
NCI-H60. NCI-H196, NCI-H446, NCI-H209, NCI-H146. NCI-H82, NCI-H460, NCI- 
H345, NCI-H510A, NCI-128, NCI-44 and SW 1271. More preferably, the cell lines 
are selected from the group consisting of CPH 54 A, CPH 54 B, GLC 2, GLC 3, GLC 
14, GLC 16, GLC 19, GLC 26. GLC 28, DMS 53. DMS 79. DMS 92, DMS 114, DMS 

20 1 53, DMS 273, DMS 406, DMS 456, NCI H69, NCI N41 7, MAR H24 and MAR 86 
Ml. 

Yet even more preferably the ceil lines are selected from the group consisting of 
CPH 54A, CPH 54 B, CHP 136A, GLC 2, GLC 3, GLC 14, GLC 16, GLC 19, GLC 
25 26, GLC 28, DMS 53, DMS 79, DMS 92, DMS 114, DMS 153, DMS 273, DMS 406, 
DMS 456, NCI-H69, NCI-N417, MAR H24 and MAR 86MI. 

Most preferably, the cell lines are selected from the group consisting of DMS 53, 
DMS 70, DMS 92, DMS 114, DMS 153, DMS 273, NCI 417 and NCI H69. 

30 



Preferred cell lines according to the present invention are listed in table 1 together 
with their accession numbers. 



WO 03/000928 



PCT/IB02/03534 



23 



Table 1 

5 Deposit Accession numbers of small ceil lung cancer cell lines 



SCLC 


Culture Col- 


(Provisional) 


Depositor 


cell line 


lection 


Accession no. 




CPH 54A 


ECACC 


01061905 




CPH 54B 


ECACC 


. 01061906 




GLC2 


ECACC 


01061907 




GLC 3 


ECACC 


01061908 




GLC 14 
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NCI-H2171 


ATTC 


CRL-5929 


NCI-H2195 




CRL-5931 


NCIN417 ! 




CRL-5809 


NCI-H2196 




CRL-5932 


NCI-H2198 




CRL-5933 


NCI-H2227 




CRL-5934 


NCI-H2286 




CRL-5938 


NCI-H2330 




CRL-5940 


NCI-H735 




CRL-5978 


NCI-H1339 




CRL-5979 


NCI-H1963 




CRL-5982 


NCI-H2107 




CRL-5983 


NCI-H2108 




CRL-5984 


NCI-H1304 




CRL-5862 


NCI-H1341 




CRL-5864 


NCI-H1417 
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NCI-H1882 
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CRL-5905 
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NCI-H524 




CRL-5831 




NCI-H592 




CRL-5832 
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CRL-5836 
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CRL-5846 
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CRL-5823 
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HTB-171 
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The methods for example involve at least 4, such as at least 5, for example at least 
6, such as at least 8, for example at least 10, such as at least 12, for example at 
least 14, such as at least 16, for example at least 18, such at least 20, for example 
21 , such at least 25, for example at least 30, such as at least 40, for example at 
5 least 50, such as at least 60, for example at least 70, such as around 79 malignant 
cell lines selected from the group consisting of CPH 54 A, CPH 54 B, GLC 2, GLC 3, 
GLC 14, GLC 16, GLC 19, GLC 26, GLC 28, DMS 53, DMS 79, DMS 92, DMS 114, 
DMS 153, DMS 273, DMS 406, DMS 456, NCI H69, NCI N417, MAR H24 and MAR 
86 Ml, SHP-77, NCI-H2171, NCI-H2195, NCI-H2196, NCI-H2198, NCI-H2227, NCI- 

10 H2286, NCI-H2330, NCI-H735, NCI-H1339, NCI-H1963, NCI-H2107, NCI-H2108, 
NCI-H1304, NCI-H1341, NCI-H1417, NCI-H1436, NCI-H1522, NCI-H1618, NCI- 
H1672, NCI-H1694, NCI-H1836, NCI-H1870, NCI-H1876, NCI-H1882, NCI-H1926, 
NCI-H1930, NCI-H1994, NCI-H2029, NCI-H2059, NCI-H2066, NCI-H2081, NCI- 
H2141, NCI-H211, NCI-H220, NCI-H250, NCI-H524/NCI-H592, NCI-H711, NCI- 

15 H719, NCI-H740, NCI-H748, NCI-H774, NCI-H841, NCI-H847, NCI-H865, NCI- 

H1048, NCI-H1059, NCI-H1092, NCI-H1105, NCI-H1184, NCI-H1238, NCI-H1284, 
NCI-H1688, NCI-H187, NCI-H378, NCI-H526, NCI-H660, NCI-H889, NCI-H60, NCI- 
HI 96, NCI-H446, NCI-H209, NCI-H146, NCI-H82; NCI-H460, NCI-H345, NCI- 
H510A, NCI-128, NCI-446 and SW 1271. 

20 

In one preferred embodiment of the invention the method involve all of the cell lines 
DMS 53, DMS 70, DMS 92, DMS 114, DMS 153, DMS 273, NCI 417 and NCI H69. 

The cell lines may be cultured by any suitable means, for example the cell lines may 
25 be cultured in an in vitro cell culture under suitable conditions known to the person 
skilled in the art. In one embodiment of the present invention, one or more cell lines 
are cultured in vivo in an animal as a xenograft. The animal may be any suitable 
animal, preferably a mammal, more preferably a rodent, most preferably a mouse. 
An example of how cell lines may be cultured in vivo as a xenograft is given in 
30 example 1. 

It is also comprised within the present invention that the same cell line may cultured 
in an in vitro cell culture and may be cultured in vivo. 
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In general, in vivo culture conditions i.e. culturing as a xenograft in an animal more 
closely resembles a natural occurring tumour or cancer and hence it is usually 
preferred that at least one, such as at least 2, for example at least 3, such as at 
least 4, for example at least 5, such as at least 6, for example at least 7, such as at 
5 least 8 cell lines are cultured in vivo. More preferably, in the range of 1 to 79, such 
as 2 to 70, for example 3 to 60, such as 4 to 50, for example 5 to 40, such as 6 to 
30, for example 7 to 20 cell lines are cultured in vivo. Even more preferably around 8 
cell lines are cultured in vivo. 

10 Preferably, the cell lines cultured in vivo are selected from the group of cell lines 
mentioned in table 1, even more preferably, the cell lines cultured in vivo are se- 
lected from the group consisting of CPH 54A, CHP 136A, GLC 3, GLC 14, DMS 
273, NCI-H69, NCI-N417 and MAR H24. 

15 Normal tissues are tissues, which are non-malignant. Preferably, such tissue is de- 
rived from an individual, which do not suffer from a premalignant and/or malignant 
condition. More preferably, the normal tissues are mammalian tissues, even more 
preferably, the tissues are human tissues. Yet more preferably, the tissues are se- 
lected from the group consisting of liver, heart, kidney, lung, adrenal gland, colon, 

20 pancreas, small intestine, spleen, skeletal, muscle, trachea, prostate, placenta, sali- 
vary gland, testes, leucocytes, leucocytes, brain, adipose tissue, bladder, breast, 
cervix, esophagus, larynx, ovary, rectum, skin, spinal cord, stomach, thymus, thyroid 
and uterus. Even more preferably, the tissues are selected from the group consist- 
ing of brain, adrenal gland, lung, kidney, heart, trachea, prostate, salivary gland, 

25 thyroid, liver, pancreas, spleen, small intestine, skeletal muscle, colon, stomach and 
testes. Most preferably the tissues are selected from the group consisting of lung, 
liver, heart and kidney. 

Preferably, the method involves at least 3, for example at least 4, such as at least 5, 
30 for example at least 6, such as at least 8, for example at least 10 total RNA 
samples. 

The method may be any method suitable to.compare the level of mRNA found in 
malignant cell lines with the levels of mRNA found in normal tissues known to the 
35 person skilled in the art. In general such method involves purification of either 



WO 03/000928 



PCT/IB02/03534 



28 



mRNA or total RNA. Purification of RNA may be performed according to any 
standard method known to the person skilled in the art as for example described in 



Sambrook et al, 1989 or herein below in the examples. 

5 The RNA samples may be compared by a number of different techniques. Any 

suitable technique may be applied with the present invention. For example the RNA 
samples can be compared by differential display or by subtractive hybridisation. 
Furthermore, techniques involving hybridisation of labelled RNA or cDNA pools with 
known nucleic acid sequences are suitable with this invention. The known nucleic 
10 acids may for example be immobilised on a solid support prior to hybridisation for 
example on a membrane/such as a nitrocellulose membrane, or the solid support 
may be of plastic or of glass. 

The labelled RNA or cDNA may be labelled with any directly or indirectly detectable 
15 label for example an enzyme, a radioactive isotope, chromophore, a fluorescent 

group or a heavy metal. Furthermore, the label may be one part of a pair of binding 
partners, wherein the second part is detectable, either directly or indirectly. For 
detection of an indirectly detectable label, it is possible to use a "sandwich" system, 
such as be one part of a pair of binding partners is recognised by the second part, 
20 which is in turn recognised by the first part, which may again be recognised by the 
second part. In every step the first and/or second part may be labelled. Examples of 
pairs of binding partners are antigen/antibodies or biotin/streptavidin. However, any 
other suitable pair can also be employed with the present invention. 

25 In one embodiment of the present invention the method comprises the steps of: 



35 



30 



v) 



i) 
ii) 



Hi) 
iv) 



Isolating RNA comprising mRNA from the malignant cell lines 
Preparing cDNA populations from said RNA, wherein one cDNA 
population is prepared from RNA isolated from one cell line or one 
tissue sample 

Labelling each cDNA population with a detectable label 
Providing solid supports on which an array of known nucleic acid 
sequences has been immobilised 

Incubating each cDNA population with a solid support under 
conditions which allows for hybridisation 
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vi) Detecting said detectable label on the solid supports 

Preferably said detectable label is an indirectly detectable label, more preferably the 
label is one part of a pair of binding partners, wherein the second part is detectable, 
5 either directly or indirectly. Most preferably the label is biotin. The biotin can be 
detected with a labelled streptavidin species, preferably a fluorescently labelled 
streptavidin. More preferably, the streptavidin may furthermore associate with an 
anti-streptavidin antibody labelled with biotin, which in turn may be detected by 
labelled streptavidin, preferably fluorescently labelled. The fluorescent label may for 
1 0 example be phycoerythrin or any other suitable fluorescent label. 

In one preferred embodiment the solid support is a glass plate. Preferably, at least 
1000, such as at least 5000, for example at least 10,000, such as at least 50.000, 
. such as at least 100.000, for example at least 150.000, such as at least 200.000, for 

1 5 example around 240.000 different known nucleic acid sequences are immobilised 

on the solid support. These nucleic acid sequences may all be derived from different 
genes, however, more preferably, each gene is represented by more than one, such 
as more than 2, for example more than 4, such as more than 7, for example more 
than 10, such as more than 15, for example more than 20, preferably around 20 

20 different nucleic acid sequences. 

In one embodiment of the present invention the RNA samples may be compared by 
a CHIPS analysis or a GeneChips analysis. The terms CHIPS analysis and 
GeneChips analysis are used interchangeably throughout the description. An 
25 example of how to perform a CHIPS analysis is given in example 1. 

Suitable cell surface molecules are selected according to several criteria. Preferably, 
there is a difference between the amount of mRNA expressed in one or more cell 
lines used in the method according to the present invention and the amount of 

30 mRNA expressed in one or more tissues according to the present invention. 

Preferably the difference is at least 1.1 fold, such as 1.2 fold, such as 1.5-fold, such 
as 1.75 fold, such as 2-fold, such as 2.5 fold, such as at least 3-fold, for example at 
least 4-fold, such as least 5-fold, for example at least 7.5 fold, such as least 10 fold 
difference in mRNA expression. In one preferred embodiment the difference is an in 

35 principle unlimited number of fold, such as there is no detectable mRNA expressed 
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in one or more cell lines and mRNA is detectable in one or more normal tissues, or 
there is no detectable mRNA expressed in one or more normal tissues and mRNA is 
detectable in one or more cell lines. 

5 Furthermore, there is preferably essentially no difference in the amount of mRNA 
expressed in at least two, such as at least 3, for example at least 4, such as at least 
5, for example at least 6, such as at least 8, for example at least 10, such as at least 
12, for example at least 14, such as at least 16, for example at least 18, such at 
least 20, for example 21, such at least 25, for example at least 30, such as at least 
10 40, for example at least 50, such as at least 60, for example at least 70, such as 
around 79 malignant cell lines used in the method according to the present 
invention. 

Additionally, there is preferably essentially no difference in the amount of mRNA 
1 5 expressed in at least two, such as at least 3, for example at least 4, such as at least 
5, for example at least 6, such as at least 8, for example at least 10 tissue samples, 
from which total RNA was used according to the methods of the present invention. 

Nucleic acid sequences encoding for potential cell surface molecules are selected 
20 from nucleic acid sequences that full fill the above criteria. In one particular preferred 
embodiment the potential cell surface molecules, are identified as outlined in 
example 1 and selected according to the criteria described in that example. 

To determine what nucleic acid sequences encode potential cell surface molecules 
25 different strategies may be employed. For example potential cell surface molecules 
may be selected according to information available in commonly accessible data- 
bases. Such databases may for example be selected from the group consisting of 
PubMed (NCBI), Nucleotide (NCBI), Protein (NCBI), Structure (NCBI), OMIM (NCBI) 
and LocusLink (NCBI). NCBI is the abbreviation for National Center for Biotechnol- 
30 ogy Information. Furthermore, potential cell surface molecules may be selected 

based on the presence of one or more of selected terms in name of the potential cell 
surface molecules. For example said terms may be selected from the group con- 
sisting of receptor, membrane, adhesion, integrin, surface, antigen, syndecan, 
transport, channel, hormone, binding, glycoprotein, matrix, CAM, desmosome, gap 
35 junction, delta, immunoglobulin, MHC, CD, HSPG, CSPG, integral and notch. 



WO 03/000928 



PCT/IB02/03534 



31 

Alternatively, nucleic acid sequences encoding for potential cell surface molecules 
are selected according to sequence homology with known cell surface molecules.. 
Nucleic acid sequences encoding potential cell surface molecules should have at 
5 least 20%, for example at least 22.5%, such as at least 25%, for example at 27.5%, 
such as at least 30% sequence identify with nucleic acid sequences encoding 
known cell surface molecules. 

The nucleic acid sequences encoding potential cell surface molecules may also be 
10 selected based on sequence homology with domains comprised within known cell 
surface molecules. Preferably, there is at least 20%, for example at least 22.5%, 
such as at least 25%, for example at 27.5%, such as at least 30%, for example at 
least 32.5%, such as at least 35%, for example at least 37.5%, such as at least 
40%, for example at least 42.5%, such as at least 45%, for example at least 47.5%, 
15 such as at least 50% sequence identify between domains of the nucleic acids 
encoding potential cell surface molecules and nucleic acid sequences encoding 
domains of known cell surface molecules. 

Nucleic acid sequences encoding potential cell surface molecules may also be 
20 selected based on that the potential cell surface molecules comprise a domain, 

which is often associated with the cell surface. Such a domain may for example be 
selected from the group consisting of hydrophobic regions and potential 
glycosylation sites. 

25 In one embodiment of the present invention candidate cell surface molecules have 
been identified by a Chips analysis. Suitable cell surface molecules may then be 
selected based on several criteria. For example cell surface molecules, which 
scored present (P) in the absolute call and with an Average difference of for 
example >10, such as >20, for example >40, such as >50 may be included. 

30 Furthermore, it is possible to make a point system to identify suitable cell surface 
molecules. For example a gene encoding a cell surface molecule may be set to 
score a number of points, such as one point for each cell line or tissue expressing 
the gene. The total scores for each gene may be summarised for normal tissue and 
the SCLC cell lines, respectively. Genes may then be selected, which were scored 

35 present in at least 3, such as 4, for example 5, such as 6, for example 7, such as 8, 
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for example 9, such as 10, for example more then 10 of the SCLC lines. A preferred 
method of selecting cell surface molecules is described in example 1 . 

The present invention also provides methods for identifying first nucleic acid 
5 sequences, which are capable of directing expression of second nucleic acid 
sequences operably linked thereto. These methods involves identifying second 
nucleic acid sequences, which are expressed at a level, which is different in 
malignant cells compared with rtormal cells. 

10 Accordingly, there is preferably a difference between the amount of second nucleic 
acid sequence mRNA expressed in one or more cell lines and the amount of second 
nucleic acid sequence mRNA expressed in one or more tissues. More preferably, 
the difference is at least 1.1 fold, such as 1.2 fold, such as- 1.5-fold, such as 1.75 
fold, such as 2-fold, such as 2.5 fold, such as at least 3-fold, for example at least 4- 

15 fold, such as least 5-fold, for example at least 7.5 fold, such as least 10 fold 
difference in mRNA expression. 

In one preferred embodiment the difference is an in principle unlimited number of 
fold, such as there is no detectable second nucleic sequence mRNA expressed in 
20 one or more cell lines and said mRNA is detectable in one or more normal tissues, 
or there is no detectable second nucleic acid sequence mRNA expressed in one or 
more normal tissues and said mRNA is detectable in one or more cell lines. 

Furthermore, there is preferably essentially no difference in the amount of second 
25 nucleic acid sequence mRNA expressed in at least two, such as at least 3, for 

example at least 4, such as at least 5, for example at least-6, such as at least 8, for 
example at least 10, such as at least 12, for example at least 14, such as at least 
16, for example at least 18, such at least 20, for example 21, such at least 25, for 
example at least 30, such as at least 40, for example at least 50, such as at least 
30 60, for example at least 70, such as around 79 malignant cell lines used with the 
methods of the present invention. 

Additionally, there is preferably essentially no difference in the amount of second 
. nucleic acid sequence mRNA expressed in at least two, such as at least 3, for 



WO 03/000928 



PCT/IB02/03534 



33 

example at least 4, such as at least 5 P for example at least 6, such as at least 8, for 
example at least 10 normal tissue samples. 

In one particularly preferred embodiment the second nucleic acid sequences are 
5 identified according to the method described in example 1 . Most preferably, the 
criteria outlined in that example are applied to select useful second nucleic acid 
sequences. 

In one embodiment of the present invention candidate promoters have been 

10 identified by a Chips analysis. Suitable promoters may then be selected based on 
several criteria based on expression level of the gene which the promoter controls. 
For example only genes, which scored present (P) in the absolute call and with an 
Average difference of >10, such as >20, for example >30, such as >40, for example 
>50 (level of expression) may be included included. A point scoring system as 

15 described herein above may be used. Genes that scored present in for example at 
least 3, such as at least 4, for example at least 5, such as at least 6, for example at 
least 7, such as at least 8, for example at least 9, such as at least 10, for example at 
least 11, such as at least 12 SCLC lines may for example be selected. If a gene 
scores present in one or more normal tissues, the median Average difference value 

20 of the SCLC cell lines should preferably be 4 times or more above the median 
Average difference value of the normal tissue. Preferably promoters with an 
Average differences of expression for normal tissues <50 and for SCLC>100 are 
selected. More preferably, promoters with an Average differences of expression for 
normal tissues <50 and for SCLC>200. Most preferably, promoters with an Average 

25 differences of expression for normal tissues <50 and for SCLC>400. Alternatively, 
promoters with an Average differences of expression in SCLC > 8 times higher than 
for normal tissue may be selected. . A preferred method of selecting cell surface 
molecules is described in example 1. 

Once second nucleic acid sequences have been identified according to the above 
30 mentioned criteria, it is possible to identify first nucleic acid sequences operably 
linked to the second nucleotide sequences. This can be done according to any 
standard method known to the person skilled in the art. For example it is possible to 
take advantage of known human genome sequences. 
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Cell surface molecules 

A cell surface molecule according to the present invention is any molecule naturally 
associated with the cell surface. Cell surface molecules may not be associated with 
5 the cell surface throughout their life time, but may be associated with the cell surface 
only at specific times. Cell surface molecules within the scope of the present inven- 
tion may be any kind of molecule associated with the cell surface, however the cell 
surface molecules according to the present invention preferably comprise a poly- 
peptide. 

10 

In one embodiment of the present invention cell surface molecules are not associ- 
ated with the cell surface, but rather are intracellular receptor. Intracellular receptors 
for example include nuclear steroid hormone receptors. 

15 However, in other preferred embodiments of the invention, cell surface molecules 
include for example molecules that are associated directly with the cell surface for 
example via a transmembrane domain, a membrane anchoring domain or a cova- 
lently linked group, which can associate with the membrane such as for example a 
lipophilic group. A lipophilic group may for example be a glycosyl-phosphatidyl- 

20 inositol group (GPI). However, it also includes molecules which are indirectly asso- 
ciated with the cell surface for example molecules that can associate with other cell 
surface molecules which are either directly or indirectly associated with the cell sur- 
face. 

25 In general a cell surface molecule comprise at least one extracellular domain, how- 
ever a cell surface molecule may comprise more than one extracellular domain such 
as 2, for example 3, such as 4 t for example 5, such as 6, for example 7, such as 8, 
for example 9, such as 10, for example more than 10 extracellular domains. 

30 Frequently, cell surface molecules are glycosylated polypeptides. 

In one preferred embodiment of the present invention, cell surface molecules are 
capable of associating with specific binding partners, and capable of internalising 
said specific binding partners upon association, i.e*. after association between bind- 
35 ing partner and cell surface molecule, the binding partner is transferred to the inte- 
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rior of the cell expressing the cell surface molecule. Frequently, the binding partner 
will be internalised by receptor mediated endocytosis, but other mechanisms are 
also possible and within the scope of the present invention. Cell surface molecules 
capable of internalising a binding partner may for example be useful for radio-, toxin- 
5 or gene therapy or cancer vaccines. 

In another preferred embodiment of the present invention, cell surface molecules 
are capable of associating with specific binding partners at the cell surface, but are 
not capable of internalising said specific binding partners. Non-internalising cell 
10 surface molecules may for example be useful for radio-therapy and cancer vaccines. 

Accession numbers from GenBank and names of preferred cell surface molecules 
are given in table 2. 
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Table 2 



Acces- 
sion 


Cell surface mole- 
cule 


Gene name 


SEQ ID 
(CDNA/ 
DNA) 


cert ir\ 
SbU ID 

(protein) 


M11507 


Transferrin receptor 


Human transferrin receptor mRNA, com- 
plete cds. 


1 


2 


X01060 


Transferrin receptor 


Human mRNA for transferrin receptor. 


3 


4 


MDU 1 JOJJ 


nr IU**o I 


norno sapiens mrciNH tor type n mem- 
brane protein, complete cds 
clone:HP10481. 


C 

0 


e 
O 




Pf^.n 


numan cnonoroiiin/aermaian suiiaie 
proteoglycan (PG40) core protein mRNA, 
complete cds. 


7 

f 


Q 

O 


AF064B01 


TRC8 


Homo sapiens multiple membrane span- 
ning receptor TRC8 (TRC8) mRNA, 
complete cd 


9 


10 


M29960 


TR2-11 


Human steroid receptor (TR2-11) mRNA, 
complete cds. 


11 


12 


X69398 


0A3 antigenic surface 
aeiBrTninani 


H.sapiens mRNA for OA3 antigenic sur- 
face determinant 


13 


14 


X53586 


Integrin alpha 6 


Human mRNA for integrin alpha 6. 


15 


16 


M 34480 


GPIIb 


Human platelet glycoprotein lib (GPIIb) 
mRNA, complete cds 


17 


18 


M14648 


Vitronectin receptor 
alpha subunit; also 
designated If GAV 


Human cell adhesion protein (vitronectin) 
receptor alpha subunit; ITGAV mRNA, 
complete cds. 


19 


20 


AF032108 


Integrin alpha-7 


Homo sapiens integrin alpha-7 mRNA, 


21 


22 
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complete cds. 






M35011 


Integrin beta- 5 subunit 


Human integrin beta-5 subunit mRNA, 
complete cds. 


23 


24 


X53002 




Human mRNA for integrin beta-5 subunit. 


25 


26 


L25851 


Integrin alpha E pre- 
cursor; also desig- 
nated ITGAE 


Homo sapiens integrin alpha E precursor; 
ITGAE, mRNA, complete cds. 


27 


28 


S66213 


Integrin alpha 6B 


Integrin alpha 6B [human, mRNA Partial, 
528 nt]. 


29 


30 


X06256 


Integrin alpha 5 sub- 
unit 


Human mRNA for integrin alpha 5 sub- 
unit. 


31 


32 


M59911 


Integrin alpha-3 chain 


Human integrin alpha-3 chain mRNA, 
complete cds! 


33 


34 


S59184 


RYK 


r\ i i\-iciaicu i\j i coepiui lyiuoiiic Miiaoc 

[human, hepatoma, mRNA, 3068 nt]. 




oo 


U50939 


Amvlniri oreeursor 
protein-binding protein 
1 


Humeri am\An\ri nrpfM ircnr nrntoin-hinrlinn 
nuuiaii cniiyiuiu piC/VsUioui \ji l/lcii i-um lull ly 

protein 1 mRNA, complete cds. 


^7 


JU 


U95822 


Putative transmem- 
brane GTPase 


Human putative transmembrane GTPase 
mRNA, partial cds. 


39 


40 


X59408 


Membrane cofactor 
protein 


H.sapiens, gene for Membrane cofactor 
protein. 


41 




L20859 


GLVR1 


Human leukemia virus receptor 1 
(GLVR1) mRNA, complete cds. 


42 


43 


D 64 154 


Mr 110,000 antigen 


Human mRNA for Mr 110,000. antigen, 
complete cds. 


44 


45 
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Inter-Alpha-Trypsin Inhibitor Heavy Chain 
LIKE gene 




47 


Z48199 


Syndecan-1 


H.sapiens syndecan-1 gene (exons 2-5). 


48 


49 


Y18007 


Putative seven trans- 
membrane domain 
protein 


Homo sapiens mRNA for putative seven 
transmembrane domain protein 


50 


51 


Y00815 


LCA-homolog/LAR 
protein 


Human mRNA for LCA-homolog. LAR 
protein (leukocyte antigen related) 


52 


53 


X64364 


M6 antigen 


H.sapiens mRNA for M6 antigen. 


54 


55 


X62654 


Me491/CD63 antigen 


H.sapiens gene for Me491/CD63 antigen. 


56 


57 


U94831 


Multispanning mem- 
brane protein 


Homo sapiens multispanning membrane 
protein mRNA, complete cds. 


58 


59 


U48705 


DDR 


Human receptor tyrosine kinase DDR 
gene, complete cds. 


60 


61 


M63175 


Autocrine motility 
factor receptor 


Human autocrine motility factor receptor 
mRNA. 


62 


63 


AB015631 


Type II membrane 
protein clone 


Homo sapiens mRNA for type II mem- 
brane protein, complete cds, 
clone:HP10390. 


64 


65 


Y00285 


Insuline-like growth 
factor II receptor 


Human mRNA for insuline-like growth 
factor II receptor. 


66 


67 
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U01160 


SAS 


Human transmembrane 4 superfamily 
protein (SAS) mRNA, complete cds 


68 


69 


M33680 


TAPA-1 


Human 26-kDa cell surface protein 
TAPA-1 mRNA, complete cds 


TA 

70 




M16941 


MHC class II HLA- 
coprotein beta-chain 


Human MHC class II HLA-DR7- 
associated glycoprotein beta-chain 
mRNA complete cds. 


72 


73 


J04599 


Bone small proteogly- 
can 1 biglycan 


Human hPGI mRNA encoding bone 
small proteoglycan I biglycan), complete 
cds 


74 


75 


Y07593 


CAR 


H.sapiens mRNA for 46 kDa coxsackievi- 
rus and adenovirus receptor (CAR) pro- 
tein. 


76 


77 


U73682 


MEA11 


Human meningioma-expressed antigen 
1 1 (MEA11) mRNA, partiel cds. 


78 


79 


U19247 


Interferon-gamma 
receptor alpha chain 


Homo sapiens interferon-gamma recep- 
tor alpha chain gene, exon 7 and com- 
plete cds. 


80 


81 


X73079 


Polymeric immuno- 
globulin receptor 


Homo sapiens encoding Polymeric im- 
munoglobulin receptor 


82 


83 


X80818 


Metabotropic gluta- 
mate receptor type 4 


H.sapiens mRNA for metabotropic glu- 
tamate receptor type 4. 


84 


85 


AF037339 


CLPTM1 


Homo sapiens cleft lip and palate trans- 
membrane protein 1 (CLPTM1) mRNA, 
complete cds. 


86 


87 


U10689 


MAGESa 


Human MAGE-5a antigen (MAGESa) 


88 


69 
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gene, complete cds. 






U03735 


MAGE-3 


Human MAGE-3 antigen (MAGE-3) 
gene, complete cds. 


90 


91 


M77481 


MAGE-1 


Human antigen (MAGE-1) gene, com- 
plete cds. 


92 


93 


U10691 


MAGE6 


Human MAGE-6 antigen (MAGE6) gene, 
complete cds. 


94 


95 


L33930 


CD24 


Homo sapiens CD24 signal transducer 
mRNA, complete cds and 3'region 


96 


97 


M 84349 


CD59 


Human transmembrane protein (CD59) 
gene, exon 4. 


98 


99 


L00352 


Low density lipoprotein 
receptor 


Human low density lipoprotein receptor 
gene, exon 18. 


100 


101 


AF023676 


Lamin B receptor 
homolog TM7SF2 


Homo sapiens lamin B receptor homolog 
TM7SF2 (TM7SF2) mRNA complete cds. 


102 


103 


U41804 


T1/ST2 receptor bind- 
ing protein precursor 


Human putative T1/ST2 receptor binding 
protein precursor mRNA, complete cds. 


104 


105 


Y10148 


NTR2 receptor 


H.sapiens mRNA for NTR2 receptor. 


106 


107 


U46194 


RAGE-4 


Human renal cell carcinoma antigen 
RAGE-4 mRNA, complete putative cds. 


108 


109 


M90683 


HLA-G1 


Human lymphocyte antigen (HLA-G1) 
mRNA, complete cds. 


110 


111 


AF1 04942 


MOAT-C 


Homo sapiens ABC transporter MOAT-C 
(MOAT-C) mRNA, complete cds. 


112 


113 
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AF042792 


Alpha 2 delta calcium 
channel subunit iso- 
form I 


Homo sapiens alpha 2 delta calcium 
channel subunit isoform I mRNA, 
compl.cds. 


114 


115 


Y00636 


LFA-3 


Human mRNA for lymphocyte function 
associateo anugen-o ^ltm-o;. 


116 


117 




LI -UAM 


H.sapiens mRNA for neural cell adhesion 

muiocuic l i 

(HSNCAML1) 


1 1 A 
I lo 


-HQ 


XM010168 


AVPR2 


Arginine-vasopressin receptor (AVPR2) 




120 




C1 p115C1 


C1 p115 




121 




TE2 


ARD1 N-acetyl transferase related pro- 
tein 




122 




RbP 


Renin binding protein 




123 




HCF-1 


Host cell factor 1 




124 




IRAK 


lnterleukin-1 -receptor associated kinase 




125 


D29963 


CD151 


Homo sapiens mRNA for CD151, com- 
plete cds. 


126 


127 


M60922 


Surface antigen 


Human surface antigen mRNA, complete 
cds. 


128 


129 


M29273 


MAG 


Human myelin-associated glycoprotein 
(MAG) mRNA, complete cds. 


130 


131 




Orn iy 


numan puiauve o proiein-coupiea re- 
ceptor (GPR19) gene, complete cds. 


132 


133 


L78132 


Pcta-1 


Human prostate carcinoma tumor antigen 
(pcta-1) mRNA, complete cds. 


134 


135 


U65011 


PRAME 


Human nreferentiallv expressed antiaen 
of melanoma (PRAME) mRNA, 
compl.cds. 


136 


137 


X81882 


Vasopressin activated 
calcium mobilizing 
receptor-like protein 


H saniens mRNA for vasooressin acti- 
vated calcium mobilizing receptor-like 
protein. 


138 


139 


U65416 


MICB 


Human MHC class 1 molecule (MICB) 
gene, complete cds. 


140 


141 


Y12505 


Serotonin receptor 5- 
HT4B 


H.sapiens mRNA for serotonin receptor 
5-HT4B, splice variant 


142 


143 
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WI3OD90 


CD9 


Human CD9 antigen mKNA, complete 
cds. 


144 


145 


AFQ77820 . 


LDL receptor member 
LR3 


Homo sapiens LDL receptor member 
LR3 mRNA, complete cds. 


146 


147 


1 HACOD 

UiOooo 


KAAr*C At* 

MAGc-40 


Human MAGc-4b antigen (MA\jb4D) 
gene, complete cds. 


148 


149 


A CA£OOCD 

Ah U boo bo 


DR6 


Homo sapiens TNFR-related death re- 
ceptor-6 (DR6) mRNA, complete cds. 


150 


151 


D 16532 


Very low density lipo- 
protein receptor 


Human gene for very low density lipo- 
protein receptor, exon 19 


152 


153 


■ 1Q4 eft A 

M8159Q 


Serotonin 1D receptor 
(5-HT1D-) 


Homo sapiens serotonin 1D receptor (5- 
HT1D-) mRNA, complete cds 


154 


155 


M582B6 


Tumour necrosis factor 
receptor 


Homo sapiens tumor necrosis factor re- 
ceptor mRNA, complete cds. 


156 


157 


AF062006 


HG38 


Homo sapiens orphan G protein-coupled 
receptor HG38 mRNA, complete cds 


158 


159 


1 IAAA4T 

U09937 


Urokinase-type plas- 
minogen receptor 


Human urokinase-type plasminogen re- 
ceptor, exon 7. 


160 


161 


M 22092 


N-CAM; also desig- 
nated NCAM1 


Human neural cell adhesion molecule (N- 
CAM) gene, exon SEC and partial cds. 


162 


163 


M34641 


FGF receptor 


Human fibroblast growth factor (FGF) 
receptor-1 mRNA, complete cds. 


164 


165 


M 14764 


Nerve growth factor 
receptor 


Human nerve growth factor receptor 
mRNA, complete cds. 


166 


167 


114 ACA4 

U 10694 


MAGt-9 


Human MAGE-9 antigen (MAGE9) gene, 
complete cds. 


168 


169 


AB026891 


Cystine/glutamate 
transporter 


Homo sapiens mRNA for cys- 
tine/glutamate transporter, complete cds. 


170 


171 


U73304 


CB1 cannabinoid 
receptor (CNR1) 


Human . CB1 cannabinoid receptor 
(CNR1) gene, complete cds. 


172 


173 






numan pregnancy- specific ueia- 1 glyco- 
protein (PSG) mRNA, comple cds 


174 


175 






nomo sapiens nurt-r\ mr\iNM ior ort* 
receptor, complete cds. 


176 


177 


AF011406 


CRH2R 


Homo ^aniens nortif^otronin rp|pa<?ino 

hormone receptor type 2 beta isoform 
(CRH2R) mRNA, complete cds. 


178 


179 


U50410 


OCI5 


Human heparan sulphate proteoglycan 
(OCI5) mRNA,complete cds 


180 


181 


AF016266 


TRAIL receptor 2 


Homo sapiens TRAIL receptor 2 mRNA, 
complete cds. 


182 


183 
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U 87947 




numan nemaiopoienc neurai rneiMordiie 
protein (HNMP-1) mRNA, complete cds. 


184 


185 




adrenergic receptor 


numan Kioney aipna-z-aarenergic rt?- 
ceptor mRNA, complete cds. 


186 


187 




IVIMVjC I I 


Unman MAPC 11 ontinon Af^P1 1 \ 

numan iviAbt-i i anugen ^ivihociij 
gene, complete cds. 


188 


189 


U25988 


PSG13' 


Human pregnancy-specific glycoprotein 
13 (PSG13*) mRNA, complete cds 


190 


191 


M60459 


Erythropoietin receptor 


Human erythropoietin receptor mRNA, 
complete cds. 


192 


193 


X15998 


Chondroitin sulphate 
proteoglycan versican 
V1 


H.sapiens mRNA for the chondroitin sul- 
phate proteoglycan versican V1 splice- 
variant; precursor peptide. 


194 


195 


U31216 


mGlulbeta 


Human metabotropic glutamate receptor 
1 beta (mGluRlbeta) mRNA, complete 
cds. 


196 


197 




CHQ7 


n. sapiens ouy/ gene exon i ^anu joineu 
CDS). 


198 


199 


M90657 


L6 


Human tumor antigen (L6) mRNA, com- 
nlete cds 


200 


201 


U87459 


NY-ESO-1 


Human autoimmunogenic cancer/testis 
antigen NY-ESO-1 mRNA, complete cds. 


202 


203 


S71824 


N-CAM; also desig- 
nated NCAM1 


N-CAM=145 kda neural cell adhesion 
molecule [human, small cell lung cancer 
cell 

line OS2-R, mRNA, 2960 nt]. 


204 


205 


AE000659 


T-cell receptor alpha 
delta 


Homo sapiens T-cell receptor alpha delta 
locus from bases 250472 to 501670 
(section 2 of 5) of the Complete Nucleo- 
tide Sequence. 


206 


207 

208 
209 
210 
211 
212 
213 
214 
215 
216 
217 
218 
219 
220 
221 
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222 


1119 r UJJ 




Human francmomhranP reCPDtor (mr2\ 

mRNA, complete cds. 


223 


224 


M81830 


SSTR2 


Human somatostatin receptor isoform 2 
(SSTR2) gene, complete cds 


225 


226 


AF030339 


VESPR 


Homo sapiens receptor for viral sema- 
phore protein (VESPR) mRNA, complete 
cds. 


227 


228 


X02160 


insulin receptor 
precursor 


Unman mRNA for iriQiilin rpppntor ore- 

cursor. 


229 


230 


U12255 


IgG Fc receptor 


Human IgG Fc receptor hFcRn mRNA, 
complete cds. 


231 


232 


X82068 


Glutamate receptor 
subunit GluRC 


H.sapiens mRNA for glutamate receptor 
subunit GluRC. 


233 


234 


X75208 


HEK2 


H.sapiens HEK2 mRNA for protein tyro- 
sine kinase receptor. 


235 


236 


X64116 


PVR 


H.sapiens PVR gene for poliovirus re- 
ceptor (exon 1), 


237 








poliovirus receptor y 




238 






poliovirus receptor (} 




239 






poliovirus receptor a 




240 


M 29 540 


CEA 


Human carcinoembryonic antigen mRNA 
(CEA), complete cds. 


241 


242 


U94888 


CC-chemokine-binding 
receptor JAB61 


Homo sapiens CC-chemokine-binding 
receptor JAB61 mRNA, complete cds. 


243 


244 


M97675 


Ror1 


Human transmembrane receptor (rorl) 
mRNA, complete cds. 


245 


246 


M 12036 


HER2 


Human tyrosine kinase-type receptor 
(HER2) gene, partial cds. 


247 


248 


1 IR7 AAA 


r^uiaiive enaoineiin 
receptor type B-like 
protein 


Human putative endothelin receptor type 
B-like protein mRNA, complete cds. 


249 


250 


L20852 


GLVR2 


Unman loiikomia win ic rorDntnr 0 

nuiiidn itjuivtsniid viruo rtiOGjJiui c. 

(GLVR2) mRNA, complete cds. 


251 


252 I 


L05424 




Human cell surface glycoprotein CD44 
(CD44) gene, 3' end of long tailed iso- 
form 


253 


254 


M59040 


CD44 


Human cell adhesion molecule (CD44) 
mRNA, complete cds 


255 


256 
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U83993 


P2X4 purinoreceptor 


Human P2X4 purinoreceptor mRNA, 
complete cds. 


257 


258 


XMJ>15664 


FLJ22357 similar to 
epidermal growth 
factor-related protein 


Homo sapiens hypothetical protein 
FLJ22357 similar to Epidermal growth 
factor receptor-related protein 
(FLJ22357) mRNA. 


261 


262 


NMJJ22450 




Homo sapiens hypothetical protein 
FLJ22357 similar to Epidermal growth 
factor receptor-related protein 
(FLJ22357) mRNA. 


259 


260 




CDDI 1 

rrrsLI 


numan lormyi pepuue recepiur-iiKe re- 
ceptor (FPRL1) mRNA, complete cds 


263 


264 


M 34309 


HER3 


Human epidermal growth factor receptor 
(HER3) mRNA, complete cds. 


265 


266 . 


M83664 


H LA-DP 


Human MHC class II IvmDhocvte antioen 
(HLA-DP) beta chain mRNA, compl.cds. 


267 


268 


AF025998 


Atrial natriuretic D€D- 
tide clearance receptor 


Homo saDiens atrial natriuretic DeDtide 
clearance receptor (AN PRC) mRNA, 
complete cds. 


269 


270 


via nncni>t 
AW1_UU0U«»4 


bastrin/uuK-b re- 
ceptor 


nomo sapiens cnoiecysiOKinin d receptor 
(CCKBR), mRNA. 


271 


272 




Neuromedin B re- 
ceptor 


numan neuromeam o recepior ^rMiviD-rcj 
mRNA, complete cds. 


273 


274 


NM_001496 


GFRA3 


Homo sapiens GDNF family receptor 
alpha 3 (GFRA3), mRNA. 


275 


276 


XM_010317 


GRPR 


Homo sapiens gastrin-releasing peptide 
receptor (GRPR), mRNA. 


277 


278 


U92459 


Metabotropic gluta- 
mate receptor 8; also 
designated GRM8 or 
GluR8 


Human metabotropic glutamate receptor 
8 (GluR8) mRNA, complete cds.; GRM8 


279 


280 


XMJ>07840 


CDH1 


Homo sapiens cadherin 1, type 1, E- 
cadherin fenithelialVCDHI} mRNA 


281 


282 


XMJH6157 


CDH2 


Homo sapiens cadherin 2, type 1, N- 
cadherin (neuronal)(CDH2),mRNA. 


283 


284 


XMJ)05591 


TGFBR1 


Homo sapiens transforming growth fac- 
tor, beta receptor I (activin A receptor 
type ll-like kinase, 53kD) (TGFBR1), 
mRNA. 


285 


286 


XMJ>03094 


TGFBR2 


Homo sapiens transforming growth fac- 
tor, beta receptor II (70-80kD) (TGFBR2), 
mRNA. 


287 


288 


XM_001924 


TGFBR3 


Homo sapiens transforming growth fac- 
tor, beta receptor III (betaglycan, 
300kD)(TGFBR3), mRNA. 


289 


290 


NM_000875 


IGF1R 


Homo sapiens insulin-like growth factor 1 
receptor (IGFlR), mRNA. 


291 


292 
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X00588 


Precursor of eDidermal 
growth factor receptor 


Human mRNA fnr nrpfiir^nr nf pnirlf*rmal 

growth factor receptor. 


293 


294 


Z75190 


LRP8 


Homo sapiens apolipoprotein E receptor 
2 (AP0ER2), also designated LRP8; 
mRNA 


295 


296 


U62434 


CHRNA 5 


Nicotinic acetylcholine receptor alpha5 
subunit (CHRNA 5); mRNA 


297 


298 


U19878 


TMEFF1 


Transmembrane protein with EGF-like 
and two follastyatin-like domains 1 
(TMEFF1); mRNA 


299 


300 


L20814 


GRIA2; also desig- 
nated GluR2 


Human glutamate receptor 2 (HBGR2); 
also designated GluR2 or GRIA2; com- 
plete coding sequence 


301 


302 


AL008583 


NPTXR 


Neuronal pentraxin receptor (NPTXR); 
DNA sequence 


303 


304 



Even more preferred cell surface molecules according to the present invention are 
5 receptors which belong to one of the following groups: 

Members of receptor tyrosine kinases 
Members of the integrin family 

Members of the immunoglobulin superfamily adhesion molecules 
10 Members of the heparan sulfate proteoglycan family 

Members of the chondroitin sulfate proteoglycan family 

Members of the MAGE family 

Members of the RAGE family 

Members of the low density lipoprotein receptor family 
1 5 Members of the cadherin adhesion molecules 

Members of the metabotropic glutamate receptors 

Members of the steroid hormone families 

Members of the seven transmembrane receptor family 

Atrial natriuretic peptide clearance receptor 
20 GFRA3 

Transferrin receptor 

Members of the serine/threonine kinase receptors 

Yet more preferred cell surface molecules according to the present invention are cell 
25 surface molecules selected from the group consisting of NCAM1 , NPTXR, LRP8, 
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CHRNA5, GRIA2, GRM8, ITGAV, ITGAE, TNFRSF12, L1CAM, GPR49 and 
TMEFFT. 

In one preferred embodiment of the present invention, cell surface molecules are 
5 capable of internalising specific binding partners upon association (see herein 

above). Preferred cell surface molecules according to the present invention capable 
of internalising a binding partner may be selected from the group consisting of 
NCAM1 , NPTXR, LRP8 and CHRNA5. 

1 0 In another preferred embodiment of the present invention, cell surface molecules 

are not capable of internalising their specific binding partners. Preferred cell surface 
molecules according to the present invention not capable of internalising binding 
partner(s) may be selected from the group consisting of GRIA2, GRM8, ITGAV and 
ITGAE 

15 

One especially preferred cell surface molecule according to the present invention is 
NCAM1 (NCAM, neural cell adhesion molecule, N-CAM, CD56)._NCAM1 is highly 
expressed in most SCLC. The expression of NCAM1 has been shown by for exam- 
ple CHIPS analysis, RT-PCR and western blotting (see example 1, figure 1 1 and 

20 figure 15). NCAM1 is capable if internalising binding partners. Binding partners ca- 
pable of associating with NCAM1 are described herein below. NCAM1 has been 
shown to be internalised in astrocytes (Minana et a/., 2001) and has been demon- 
strated capable of very efficient internalisation of a very large molecule complex 
consisting of: anti-NCAM1 antibody-Protein A-Streptavidine-Biotin-p-galactosidase 

25 enzyme ( Yu et a/. , 2000). 

Another preferred cell surface molecule according to the present invention is 
NPTXR (Neuronal pentraxin receptor , NPR). NPTXR is expressed in most SCLC. 
The expression of NPTXR has been demonstrated by for example CHIPS analysis, 
30 RT-PCR and western blotting (see example 1, figure 10 and figure 14). NPTXR is a 
highly internalising receptor. Binding partners capable of associating with NPTXR 
are described herein below. 



35 



Yet another preferred cell surface molecule according to the present invention is 
LRP8 (low density lipoprotein receptor related protein, apolipoprotein E receptor 2). 
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LRP8 is expressed in all tested SCLC. The expression of LRP8 has been demon- 
strated by for example CHIPS analysis and RT-PCR (see example 1 and figure 9). 
LRP8 is a highly internalising receptor. Binding partners capable of associating with 
LRP8 are described herein below. 

5 

Yet another preferred cell surface molecule according to the present invention is 
CHRNA5 (nicotinic acetylcholine receptor alphas subunit). Binding partners capable 
of associating with CHRNA5 are described herein below. 

10 Yet another cell surface molecule that may be used with the present invention is 
L1CAM (neural cell adhesion molecule L1). L1 is known to be able to internalise 
binding partners. Binding partners capable of associating with L1CAM are described 
herein below. 

15 Yet another preferred cell surface molecule according to the present invention is 
TNFRSF12 (DR6, tumor necrosis factor receptor superfamily member 21). 
TNFRSF12 is expressed in most SCLC. The expression of TNFRSF12 has been 
' demonstrated by for example CHIPS analysis and RT-PCR (see example 1 and 
figure 8). Other members of the family to which TNFRSF12 belong are capable of 

20 internalising binding partners. 

An especially preferred cell surface molecule according to the present invention is 
GRIA2 (lonotropic glutamate receptor 2, GLUR2, GLURB, HBGR2, AMPA 2). 
GRIA2 is expressed in all tested SCLC and in the brain. GRIA2 is a highly specific 
25 SCLC receptor outside the brain. Expression of GRIA2 has for example been dem- 
onstrated by Chips analysis, RT-PCR and Western blotting (see example 1, figure 
12A and figure 16). 

Another preferred cell surface molecule according to the present invention is GRM8 
30 (metabotropic glutamate 8 receptor, GLUR8, mGlu8, GPRC1HU3RM8 is highly 

specifically expressed in SCLC except for the brain and is expressed in most SCLC. 
The expression of GRM8 has been demonstrated by for example Chips analysis, 
RT-PCR and western blotting (see example 1 and figure 13). Binding partners ca- 
pable of associating with GRM8 are described herein below. 

35 
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Yet another preferred cell surface molecule according to the invention is ITGAV (In- 
tegrin subunit ocv, vitronectin receptor, CD 51). itGAV is highly expressed by SCLC. 
Expression has been demonstrated by for example CHIPS analysis and RT-PCR 
(see example 1 and figure 12B) 

5 

Yet another preferred cell surface molecule according to the present invention is 
ITGAE (integrin aE subunit-precursor, human mucosal lymphocyte-1'antigen, CD 
103). ITGAE is expressed by all SCLC tested and ITGAE is highly specifically ex- 
pressed in SCLC. Expression has been demonstrated by for example CHIPS analy- 
10 .sis and western blotting (see example 1 and figure 17). 

Yet another preferred cell surface molecule according to the present invention is 
GPR49 (orphan G protein-coupled receptor 67, GPR67, HG38). 

15 Yet another preferred cell surface molecule according to the present invention is 
TMEFF1 (transmembrane protein with EGF-like and two follastatin-like domains). 

However, the present invention is also directed towards cell surface molecules 
which comprises fragments, which are encoded by fragments of the nucleotide se- 

20 quences given in table 2. In one preferred embodiment, the present invention is di- 
rected towards cell surface molecules encoded by splice variants of these se- 
quences, which are encoded by the same gene. Splice variants of cell surface 
molecules outlined in table 2 may encode a polypeptide sequence which share 
fragments with said cell surface molecules; however splice variants may take ad- 

25 vantage of an alternative reading frame, so that although the products of the two 

splice variants are encoded by nucleotide sequences that share common fragments, 
the polypeptide sequences may not be related. 

# 

Furthermore, the present invention is directed to fragments of the nucleotide se- 
30 quences encoding cell surface molecules according to table 2. In particular, binding 
partners according to the present invention (se herein below) may associate with 
products of only one or more fragments of a cell surface molecule according to the 
present invention, but preferably not with all fragments of a cell surface molecule. 
Accordingly, it is possible to use fragments of the cell surface molecules to identify 
35 potential binding partners (see herein below). 
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For example such fragments comprise the 5' half of the sequence or the 3' half of 
the sequences. Furthermore, the fragments may comprise part of the 5' half or part 
of the 3' half of the sequences. Preferably, such fragments are shorter than 5000 
5 bp, such as shorter than 4000 bp, for example shorter than 3000 bp, such as shorter 
than 2500 bp, for example shorter than 2000 bp, such as shorter than 1750 bp, such 
as shorter than 1500 bp, for example shorter than 1250 bp, such as shorter than 
1000 bp, for example shorter than 900 bp, such as shorter than 800 bp, for example 
shorter than 700 bp, such as shorter than 600 bp, for example shorter than 500 bp, 
10 such as shorter than 400 bp, for example shorter than 300 bp, such as shorter than 
200 bp, for example shorter than 100 bp, such as shorter than 75 bp, for example 
shorter than 50 bp, such as shorter than 40 bp, for example shorter than 30 bp, such 
as shorter than 25 bp, for example shorter than 20 bp, such as shorter than 18 bp. 

1 5 Such fragments may be internal fragments or they may be comprise the 5' or the 3' 
terminal. 

In one preferred embodiment of the present invention the fragments comprise a plu- 
rality of building blocks of a predetermined length and wherein the building blocks 
20 are linked so that the fragments are identical to part of a native gene sequence, 

preferably the sequences outlined in table 2. Accordingly, fragments may comprise 
a plurality of building blocks of the predetermined length and a predetermined start- 
ing point. 

25 Building blocks are nucleic acid sequences, which have a predetermined length and 
starting point, so that the first building block starts at a given position in the nucleic 
acid sequence and the subsequent building blocks starts at the position following 
the position where the previous building block stops. 

30 Preferably, the building blocks are derived from any of the sequences SEQ ID NO. 
1, SEQ ID NO. 3, SEQ ID NO. 5, SEQ ID NO. 7, SEQ ID NO. 9, SEQ ID NO. 11, 
SEQ ID NO. 13, SEQ ID NO. 15, SEQ ID NO. 17, SEQ ID NO. 19, SEQ ID NO. 21, 
SEQ ID NO. 23, SEQ JD NO. 25, SEQ ID NO. 27, 29, SEQ ID NO. 31, SEQ ID NO. 
33, SEQ ID NO. 35, SEQ ID NO. 37, SEQ ID NO. 39, SEQ ID NO. 41, SEQ ID NO. 

35 42, SEQ ID NO. 44, SEQ ID NO. 48, SEQ ID NO. 50, SEQ ID NO. 52, SEQ ID NO. 
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54, SEQ ID NO. 56, SEQ ID NO. 58, SEQ ID NO. 60, SEQ ID NO. 62, SEQ ID NO. 
64, SEQ ID NO. 66, SEQ ID NO. 68, SEQ ID NO. 70, SEQ ID NO. 72, SEQ ID NO. 
74, SEQ ID NO. 76, SEQ ID NO. 78, SEQ ID NO. 80, SEQ ID NO. 82, SEQ ID NO. 
84, SEQ ID NO. 86, SEQ ID NO. 88, SEQ ID NO. 90. SEQ ID NO. 92, SEQ ID NO. 
5 94, SEQ ID NO. 96, SEQ ID NO. 98, SEQ ID NO. 100, SEQ ID NO. 102, SEQ ID 
NO. 104, SEQ ID NO. 106, SEQ ID NO. 108, SEQ ID NO. 110, SEQ ID NO. 112, 
SEQ ID NO. 114, SEQ ID NO. 116, SEQ ID NO. 118, SEQ ID NO. 126, SEQ ID NO. 
128, SEQ ID NO. 130, SEQ ID NO. 132, SEQ ID NO. 134, SEQ ID NO. 136, SEQ 
ID NO. 138, SEQ ID NO. 140, SEQ ID NO. 142, SEQ ID NO. 144, SEQ ID NO. 146, 

10 SEQ ID NO. 148, SEQ ID NO. 150, SEQ ID NO. 152, SEQ ID NO. 154, SEQ ID NO. 
156, SEQ ID NO. 158, SEQ ID NO. 160, SEQ ID NO. 162, SEQ ID NO.164, SEQ ID 
NO. 166, SEQ ID NO. 168, SEQ ID NO. 170, SEQ ID NO. 172, SEQ ID NO. 174, 
SEQ ID NO. 176, SEQ ID NQ. 178, SEQ ID NO. 180, SEQ ID NO. 182, SEQ ID NO. 
184, SEQ ID NO. 186, SEQ ID NO. 188, SEQ ID NO. 190, SEQ ID NO. 192, SEQ 

15 ID NO. 194. SEQ ID NO. 196, SEQ ID NO. 198, SEQ ID NO. 200. SEQ ID NO. 202. 
SEQ ID NO. 204, SEQ ID NO. 206. SEQ ID NO. 223. SEQ ID NO. 225, SEQ ID NO. 
227, SEQ ID NO. 229, SEQ ID NO. 231. SEQ ID NO. 233, SEQ ID NO. 235, SEQ 
ID NO. 237, SEQ ID NO. 241, SEQ ID NO. 243. SEQ ID NO. 245. SEQ ID NO. 247, 
SEQ ID NO. 249, SEQ ID NO. 251, SEQ ID NO. 253. SEQ ID NO. 255, SEQ ID NO. 

20 257. SEQ ID NO. 261 , SEQ ID NO. 259, SEQ ID NO. 263, SEQ ID NO. 265, SEQ 
ID NO. 267, SEQ ID NO. 269, SEQ ID NO. 271. SEQ ID NO. 273, SEQ ID NO. 275, 
SEQ ID NO. 277, SEQ ID NO. 279, SEQ ID NO. 281. SEQ ID NO. 283, SEQ ID NO. 
285, SEQ ID NO. 287, SEQ ID NO. 289. SEQ ID. NO. 291. SEQ ID NO. 293, SEQ 
ID NO. 295, SEQ ID N0.297, SEQ ID NO.299, SEQ ID NO.301 and SEQ ID 

25 NO.303. 

Each building block is preferably shorter than 1000 bp, for example shorter than 900 
bp, such as shorter than 800 bp, for example shorter than 700 bp, such as shorter 
than 600 bp, for example shorter than 500 bp, such as shorter than 400 bp, for ex- 
30 ample shorter than 300 bp, such as shorter than 200 bp, for example shorter than 
100 bp, such as shorter than 75 bp, for example shorter than 50 bp, such as shorter 
than 40 bp, for example shorter than 30 bp, such as shorter than 25 bp, for example 
shorter than 20 bp, such as shorter than 18 bp. In one embodiment the building 
block is around 18 bp. 



35 
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The building blocks may start at position 1, such as position 2, for example position 
3, such as position 4, for example position 5, such as position 6, for example posi- 
tion 7, such as position 8 f for example position 9, such as position 10, for example 
position 11, such as position 12, for example position 13, such as position 14, for 
5 example position 15, such as position 16, for example position 17, such as position 
18, for example position 19, such as position 20, such as any of the positions 20 to 
100, for example any of the position 100 of any of the sequences outlined in table 2. 

The fragments preferably comprise a plurality of building blocks, such as 2, for ex- 
10 ample 3, such as 4, for example 5, such as from 5 to 10, for example from 10 to 20, 
such as from 20 to 30, for example from 30 to 40, such as from 40 to 50, for exam- 
ple from 50 to 75, such as from 75 to 100, for example more than 100 building 
blocks. 

15 In one embodiment the fragments comprise building blocks which are 100 base 
pairs long and which start at position 1 . 

Furthermore, fragments of cell surface molecules according to the present invention 
may be chimeric fragments, such chimeric fragments comprise more than one frag- 
20 ments which are not associated with each other according to the sequences outlined 
in table 2. Such chimeric fragments may comprise fragments from the same cell 
surface molecule or they may contain fragments from more than one cell surface 
molecule according to the invention. 

25 Furthermore, the present invention is directed to fragments of the polypeptides se- 
quences of cell surface molecules according to table 2. In particular, binding part- 
ners according to the present invention (se herein below) may associate with only 
one or more fragments of a cell surface molecule according to the present invention, 
but preferably not with all fragments of a cell surface molecule. Accordingly, it is 

30 possible to use fragments of the cell surface molecules to identify potential binding 
partners (see herein below). 

Fragments of polypeptide sequences may be shorter than 3000 amino acids, such 
as shorter than 2500 amino acids, for example shorter than 2000 amino .acids, such 
35 as shorter than 1750 amino acids, such as shorter than 1500 amino acids, for ex- 
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ample shorter than 1250 amino acids, such as shorter than 1000 amino acids, for 
example shorter than 900 amino acids, such as shorter than 800 amino acids, for 
example shorter than 700 amino acids, such as shorter than 600 amino acids, for 
example shorter than 500 amino acids, such as shorter than 400 amino acids, for 
5 example shorter than 300 amino acids, such as shorter than 200 amino acids, for 
example shorter than 100 amino acids, such as shorter than 75 amino acids, for 
example shorter than 50 amino acids, such as shorter than 40 amino acids, for ex- 
ample shorter than 30 amino acids, such as shorter than 25 amino acids, for exam- 
ple shorter than 20 amino acids, such as shorter than 15 amino acids, for example 
10 shorter than 10 amino acids. 

Preferably, the fragments are fragments of polypeptide sequences SEQ ID NO. 2, 
SEQ ID NO. 4, SEQ ID NO. 6, SEQ ID NO. 8, SEQ ID NO. 10, SEQ ID NO. 12, 
SEQ ID NO. 14, SEQ ID NO. 16. SEQ ID NO. 18, SEQ ID NO. 20, SEQ ID NO. 22, 

15 SEQ ID NO. 24, SEQ ID NO. 26. SEQ ID NO. 28, SEQ ID NO. 30, SEQ ID NO. 32, 
SEQ ID NO. 34, SEQ ID NO. 36, SEQ ID NO. 38, SEQ ID NO. 40, SEQ ID NO. 43, 
SEQ ID NO. 45. SEQ ID NO. 47, SEQ ID NO. 49, SEQ ID NO. 51, SEQ ID NO. 53, 
SEQ ID NO. 55. SEQ ID NO. 57, SEQ ID NO. 59. SEQ ID NO. 61. SEQ ID NO. 63, 
SEQ ID NO. 65, SEQ ID NO. 67, SEQ ID NO. 69, SEQ ID NO. 71, SEQ ID NO. 73, 

20 SEQ ID NO. 75, SEQ ID NO. 77. SEQ ID NO. 79. SEQ ID NO. 81 . SEQ ID NO. 83, 
SEQ ID NO. 85. SEQ ID NO. 87. SEQ ID NO. 89. SEQ ID NO. 91. SEQ ID NO. 93, 
SEQ ID NO. 95, SEQ ID NO. 97, SEQ ID NO. 99, SEQ ID NO. 101, SEQ ID NO. 
103, SEQ ID NO. 105, SEQ ID NO. 107, SEQ ID NO. 109, SEQ ID NO. 1 1 1 , SEQ 
ID NO. 113, SEQ ID NO. 115, SEQ iD NO. 117, SEQ ID NO. 119. SEQ ID NO. 120, 

25 SEQ ID NO. 121, SEQ ID NO. 122, SEQ ID NO. 123, SEQ ID NO. 124 SEQ ID NO. 
125, SEQ ID NO. 126, SEQ ID NO. 127, SEQ ID NO. 129, SEQ ID NO. 131, SEQ 
ID NO. 133. SEQ ID NO. 135, SEQ ID NO. 137, SEQ ID NO. 139, SEQ ID NO. 141, 
SEQ ID NO. 143, SEQ ID NO. 146, SEQ ID NO.147. SEQ ID NO. 149, SEQ ID NO. 
151, SEQ ID NO. 153, SEQ ID NO. 155, SEQ ID NO. 157, SEQ ID NO. 159, SEQ 

30 ID NO. 161. SEQ ID NO. 163, SEQ ID NO. 165, SEQ ID NO. 167. SEQ ID NO. 169, 
SEQ ID NO. 171, SEQ ID NO. 173. SEQ ID NO. 175, SEQ ID NO. 177, SEQ ID NO. 
179, SEQ ID NO. 181, SEQ ID NO. 183, SEQ ID NO. 185, SEQ ID NO. 187, SEQ 
ID NO. 189, SEQ ID NO. 191, SEQ ID NO. 193, SEQ ID NO. 195. SEQ ID NO. 197. 
SEQ ID NO. 199, SEQ ID NO. 201 , SEQ ID NO. 203, SEQ ID NO. 205, SEQ ID NO. 

35 207, SEQ ID NO. 208, SEQ ID NO. 209, SEQ ID NO. 210, SEQ ID NO. 211, SEQ 
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ID NO. 212, SEQ ID NO. 213, SEQ ID NO. 214, SEQ ID NO. 215, SEQ ID NO. 216, 
SEQ ID NO. 217, SEQ ID NO. 218, SEQ ID NO. 219, SEQ ID NO. 220, SEQ ID NO. 
221, SEQ ID NO. 222, SEQ ID NO. 224, SEQ ID NO. 226, SEQ ID NO. 228, SEQ 
ID NO. 230, SEQ ID NO. 232, SEQ ID NO. 234, SEQ ID NO. 236, SEQ ID NO. 238, 
5 SEQ ID NO. 240, SEQ ID NO. 242 and SEQ ID NO. 244, SEQ ID NO. 246, SEQ ID 
NO. 248, SEQ ID NO. 250, SEQ ID NO. 252, SEQ ID NO. 254, SEQ ID NO. 256, 
SEQ ID NO. 258, SEQ ID NO. 260, SEQ ID NO. 262, SEQ ID NO. 264, SEQ ID NO. 
266, SEQ ID NO, 268, SEQ ID NO. 270, SEQ ID NO. 272, SEQ ID NO. 274, SEQ 
ID NO. 276, SEQ ID NO. 278, SEQ ID NO. 280, SEQ ID NO. 282 SEQ ID NO. 284, 

10 SEQ ID NO. 286, SEQ ID NO. 288, SEQ ID NO. 290, SEQ ID NO. 292, SEQ ID NO. 
294, SEQ ID N0.296, SEQ ID N0.298, SEQ ID NO. 300, SEQ ID NO. 302 and SEQ 
ID NO. 304. Furthermore, functional homologues of the fragments of the polypeptide 
sequences may also be comprised within cell surface molecules of the present in- 
vention, or the cell surface molecules may consist of functional homologues of the 

15 fragments of the polypeptide sequences. Functional homologues are defined herein 
below. 

Particularly preferred fragments of the cell surface molecules according to the pres- 
ent invention are fragments that comprises one or more extracellular domains of the 
20 cell surface molecules. Additionally, fragments which comprise parts of extracellular 
domains are also preferred fragments within the scope of the present invention. 
Most preferably, the fragments of the cell surface molecules according to the pres- 
ent invention are fragments that comprise one or more extracellular domains and 
which are capable of internalising a binding partner having affinity for said fragment. 

25 

In one embodiment the fragments comprise an extracellular domain or fragments 
thereof or derivatives thereof, wherein said extracellular domain may be selected 
from the group consisting of polypeptide sequences, which are encoded by the nu- 
cleotide sequences SEQ ID NO 3 nucleotide 1014 to 2450, SEQ ID NO 15 nucleo- 

30 tide 216 to 3179, SEQ ID NO. 31 nucleotide 147 to 2999, SEQ ID NO 52 nucleotide 
419 to 4120, SEQ ID NO 66 nucleotide 268 to 7059, SEQ ID NO 82 nucleotide 235 
to 2094, SEQ ID NO104 nucleotide 160 to 663, SEQ ID NO 204 nucleotide 301 to 
2250, SEQ ID NO 229 nucleotide 130 to 2880, SEQ ID NO 281 nucleotide 569 to 
2152, SEQ ID NO 283 nucleotide 585 to 1901, SEQ ID NO 291 nucleotide 121 to 

35 2836 and SEQ ID NO 293 nucleotide 259 to 2127. 
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The cell surface molecule and the fragments of cell surface molecules as outlined 
herein above, may furthermore comprise posttranslational modifications. Examples 
of posttranslational modifications are phosphorylations, glycosylation, acetylations, 
5 methylation, sulfatation, polysialylation, farnesylation, myristoylation or palmitylation. 

Functional homologues of the cell surface molecules outlined in table 2 are also 
contained within the present invention. SEQ ID NO of polypeptide sequences en- 
coding preferred cell surface molecules according to the present invention are also 
10 given in table 2. Functional homologues of cell surface molecules according to the 
present invention are cell surface molecules which can associate with the binding 
partners according to the present invention and which preferably can internalise said 
binding partners. 

15 Promoters 

Promoters within the scope of the present invention are first nucleic acid sequences, 
which are capable of directing expression of second nucleic acid sequences 
operably linked thereto. Such first nucleic acid sequences are normally found 
20 upstream on the chromosome of nucleic acid sequences that may be transcribed. 

In one embodiment, preferably a first nucleic acid sequence operably linked to a 
second nucleic acid sequence comprise at least 15,000 base pairs upstream of the 
translation start codon of said second nucleic acid sequence on the chromosome. 

25 However, the first nucleic acid sequence operably linked to a second nucleic acid 
sequence may also comprise at least 12.500, such as at least 10.000, for example 
at least 8,000, such as at least 6,000, such as at least 5,000, such as at least 4,000, 
such as at least 3,000, for example at least 2,500, such as at least 2,000, such as at 
least 1,500, such as at least 1,000, for example at least 500, such as at least 400, 

30 for example at least 300, such as at least 200, for example at least 150, such as at 
least 100, for example at least 50, such as at least 25, for example at least 10 base 
pairs upstream of the translation start codon of said second nucleic acid sequence 
on the chromosome, or a fragment of any such sequences capable of directing gene 
expression. 
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Furthermore, in another embodiment the first nucleic acid sequence operably linked 
to a second nucleic acid sequence preferably comprises up to 10, such as up to 
100, such as up to 500, for example up to 1000, such as up to 2500, for example up 
5 to 5000 base pairs upstream of the translation start codon of said second nucleic 
acid sequence on the chromosome, or a fragment thereof capable of directing gene 
expression. . 

It is also within the scope of the present invention, that the first nucleic acid 
10 sequence operably linked to a second nucleic acid sequence may comprise one or 
more intron sequences or fragments thereof found upstream of the translation start 
codon of said second nucleic acid sequence on the chromosome. Furthermore, the 
first nucleic acid sequence operably linked to a second nucleic acid sequence may 
comprise one or more intron sequences or fragments thereof found downstream of 
1 5 the translation start codon of said second nucleic acid sequence on the 
chromosome. 

The first nucleic acid sequence operably linked to a second nucleic acid sequence 
may furthermore comprise an enhancer sequence located more than 15,000 base 
20 pairs upstream or downstream from the translation start codon of said second 
nucleic acid sequence on the chromosome. 



The above mentioned first nucleic acid sequence operably linked to a second 
25 nucleic acid sequence may also comprise deletions and/or additions of nucleic 

acids. Deletions and/or additions may be internal or they may be at the end of the 
nucleic acid sequence. 

Accordingly, from a first nucleic acid sequence, which for example comprise up to 
30 1000, such as up to 2500, for example up to 5000, such as up to 7500, such as up 
to 10,000, for example up to 15,000, such as up to 20,000 base pairs upstream from 
the translation start codon of a second nucleic acid sequence on the chromosome, 
at least 10 internal bp, such as at least 25 internal bp, for example at least 50 
internal bp, such as at least 100 internal bp, for example 200 internal bp, such as at 
35 least 300 internal bp, for example at least 400 internal bp, such as at least 500 
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internal bp, for example at least 750 internal bp, such as at least 1000 internal bp, 
for example at least 1250, such as at least 1500, for example at least 1750, such as 
at least 2000, for example at least 2500, such as at least 3000, for example at least 
3500, such as at least 4000, for example at least 4500 internal base pairs may be 
5 deleted. 

Specific nucleic acid sequences may be more favourable to delete than others. For 
example sequences that suppress expression or sequences that do not alter 
expression of second nucleic acid sequences operably linked thereto may be 

10 deleted. Accordingly, the present invention also encompass first nucleic acid 

sequence operably linked to a second nucleic acid sequence comprising up to 1000, 
such as up to 2500, for example up to 5000, such as up to 7500, such as up to 
10,000, for example up to 15,000, such as up to 20,000 base pairs upstream from 
the translation start codon of a second nucleic acid sequence on the chromosome, 

15 from which at least one silencer has been deleted. 

It is also possible that the first nucleic acid sequence comprises more than one 
deletion, such as 2, for example 3, such as 4, for example 5, such as more than 5 
deletions. 

20 

Furthermore, from a first nucleic acid sequence, which for example comprise up to 
20,000, for example up to 15,000, such as up to 10,000, for example up to 7500, for 
example up to 5000, such as up to 2500, for example up to 1000 base pairs 
upstream from the translation start codon of a second nucleic acid sequence on the 

25 chromosome, at least 10, such as at least 25, for example at least 50, such as at 
least 100, for example 200, such as at least 300, for example at least 400, such as 
at least 500, for example at least 750, such as at least 1000, for example at least 
1250, such as at least 1500, for example at least 1750, such as at least 2000, for 
example at least 2500, such as at least 3000, for example at least 3500, such as at 

30 least 4000, for example at least 4500 base pairs may be deleted from either one or 
the other end. 

Additions of nucleic acid sequences may be done at the end or internally. First 
nucleic acid sequences may comprise more than one addition, for example 2, such 
35 as 3, for example 4, such as 5, for example more than 5 additions. It may be 
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addition of at least 10, such as at least 25, for example at least 50, such as at least 
100, for example 200, such as at least 300, for example at least 400, such as at 
least 500, for example at least 750, such as at least 1000, for example at least 1250, 
such as at least 1 500, for example at least 1750, such as at least 2000, for example 
5 at least 2500, such as at least 3000, for example at least 3500, such as at least 
4000, for example at least 4500, such as more than 4500 base pairs. 

For example it is possible to add nucleic acid sequences that alter the function of the 
first nucleic acid sequence. For example nucleic acid sequences which are 
10 recognised by specific transcription factors may be added. For example nucleic acid 
sequences that are recognised by nuclear steroid hormone receptors 

Examples of preferred first nucleic acid sequences are given in table 3 and 4. 
Functional homologues as described herein below of these nucleic acid sequences 
15 as well as deletion and/or addition mutants as described herein above are also 
comprised with in the present invention. 

Table 3 First nucleic acid sequences indicating accession no. in the Blast da- 
tabase Version: May 10, 2001 

20 (http://www.ncbi. nlm.nih.qov/qenome/seq/paqe.cqi?F=HsBlast.html&&ORG=Hs) 



No 


Acc. No. 


Chromosome 


Region 


Bases 


pro4 


J05614 


20 


NT 011387.3 


4.073.925-4.125.777 


pro12 


AF059531 


11 


NT 009107.3 
+ 


16.582.000-16.639.000 


pro17 


AA913812 


1 


NT QQ470^.3 
+ 


145.236.000-145.282.000 


pro18 


W25866 


1 


NT 004705.3 
+ 


145.240.856-145.281.103 


. pro28 


W74442 


15 


NT 010356.3 


87.186.215-87.226.899 


pro37 


U63743 


1 


NT 004B08.3 


73.601.528-73.642.295 
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p(04y 


AAz0o476 


4 


NT 006291.3 
+ 


46.964.650-47.005.368 


8 


NT 008166.3 


CC QIC A CO CC QCC TOO 


pro53 


M15205 


1T 


NT 024874.3 
? 


78.209.000-78.266.000 


pro55 


Z95118 


6 


NT_007577 




pro62 


ALOoUoUo 


X 


NT 011553 




11 


NT 025842 




pro73 


U18271 


12 


NT 009681.3 
+ 


103.894.000-103.940.000 


pro74 


U03735 


X 


NT 011534.1 
+ 


138.253.000-138.770.000 


pro76 


U89387 


2 


NT 005409.3 


130.026.000-130.083.000 


pro77 


U 10689 


X 


NT 011726.3 
? 


137.038.000-137.108.000 


pro83 


AI553745 


5 


NT 023270.3 


177.081 .000-177.128.000 


pro86 


L18877 


x 


NT 0111534,1 
+ 


138.233.000-138.771.000 


pro87 


U10691 


x 


NT 011534.1 


138.233.000-138.770.000 


pro89 


L 18920 


x 


NT 011534.1 
+ 


138.233.000-138.770.000 


pro91 


N23137 


12 


NT 009464.3 


132.284.168-132.324.678 


pro93 


W28479 


13 


NT 024495.3 
+ 


87.871.068-87.891.259 


pro97 


r\r\JOOO\JD 


6 


NT 007592.3 


37.167.152-37.207.246 


12 


NT 009782.3 

+ 


53.429.369-53.469.399 


pro103 


U77949 


17 


NT 024901.3 


40.905.000-40.968.000 


pro112 


AL021546 


12 


NT 009775 
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piu \C\J 


M31315 


5 
*j 


NT 023188.3 
? 


176.642.000-176.692.000 


pro \C I 






NT 004680.3 
♦ 


214.431.387-214.471.483 


pro122 


AL021366 


6 


NT 007592.3 
+ 


31.000.000-38.000.000 


pro123 


. N58115 


2 


NT 005343.3 


184.162.000-184.223.000 


pro129 


AL031427 


1 


NT 004424 




pro139 


Y13115 


8 


NT 008.176.3 


39.029.000-39.073.000 


pro142 


AA151922 


17 


NT 010711.3 


73.365.739-73.406.421 


pro153 


U07563 


9 


NT 008338 




\JI\J 1 wTf 


W9R7R9 




NT 019273.3 


115.514.000-115.557.000 


JJIU 1 <JU 


/A I JJUJOt 


17 


NT 010641.3 


77.160.000-77.208.000 


pro161 


M21259 


1 


NT 004662.3 
+ 


175.106.661-175.147.391 






17 


NT 024874 




pro166 


AA926957 


<; 

■ 


NT 016864.3 


52.120.104-52.140.918 


17 


NT 01Q992.3 
+ 


1.029.000-1.077.000 


nm171 
pro 1 f 1 




1^ 


NT 024495.3 
+ 


89.323.836-89.365.575 


pro i f t 




10 


NT 010934.3 


35.186.000-35.241.000 


pro176 


AA181196 


13 


NT 009799.3 
+ 


49.703.000-49.752.000 


pro178 


AB002359 


15 


NT 010364.3 


19.619.949-19.660.316 


pro183 


U13695 


2 


NT 022*197.3 


197.599.000-197.820.000 
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pro184 


U65011 


22 


NT 011520,5 
+ 


18.956.000-19.011.000 


pro194 


AF091754 


1 


NT 004734.3 


233.039.886-233.081.072 


pro198 


AL049842 


13 


NT 009910 




pro202 


AA810792 


12 


NT 9994^4.? 
+ 


132.288.241-132.328.844 


pro210 


L37747 


5 


NT 023326.1 


122.981,845-123.022.988 


pro212 


Z36714 


2 


NT 005370.3 


212.774.647-212.814.782 


pro216 


X06745 


X 


NT 011821.1 

o 

r 


21.110.482-21.151.898 


pro225 


T75292 


4 


NT 006088.3 
+ 


82.210.258-82.250.894 


pro228 


U38979 


1 


NT 021930.3 


97.904.873-97.945.550 


pro229 


AL031778 


6 


NT 016968 




pro230 


AA044787 


5 


NT 006964.3 


154.054.000-154.122.000 


pro232 


M77481 


X 


NT 011534.1 

+ 


138.236.000-138.766.000 


pro234 


AL021397 


1 


NT 004668 




pro239 


AA595596 


1 


NT 004668 




pro240 


W25874 


4 


NT 006088.3 
+ 


82.210.000-82.256.000 


pro241 


AC004774 


7 


NT 007816 




Pro245 


AF006010 


6 


NT 007978.3 


104.966.000-105.280.000 



WO 03/000928 



PCT/IB02/03534 
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AAf ooUOU 


A 

I 


NT 004662.3 
+ 


175.107.000-175.116.000 


pro260 


AF094481 


3 


NT 005791.3 


84.866.000-84.938.000 


pro268 


L16991 


5 


NT 023126.3 


20.814.448-20.854.499 


pro273 


AC004770 


11 


NT 004686: 11a12 


64224936-64225035 


pro280 


A lOQKQfi/f 




NT 011514.4 


43.166.000-43.211.000 


pro284 


A AQORQ^Q 


0 


NT 006687.3 


48.821.840-48.862.512 


pro292 


U^OOOD 


A 
4 


NT 006109.2 


: 89.638.000-89.682.000 


pro299 


AIHQ70CQ 
AlUOfZOO 


0 


NT 007193.3 
+ 


38.723.000-38.783.000 


pro302 


A Cfl'l AfilT 
ArU14oo/ 


1/1 


NT 019583.3 

+ 


1 8.376.000-1 8.432.000 


pro 303 


AID/ UD/ £ 


OO 


NT 011520.5 
+ 


33.485.000-33.531 .000 


pro304 


L17131 


£. 


NT 005428.3 


76.252.000-76.295.000 


pro 306 


Al / 4UD^C 


0 


NT 007234.3 


1 28.840 . 581 -1 28.881 .528 


pro328 


M15796 


v 

A • 


NT 011568.3 


33 655 000-33 706 000 


on 


NT 011387.3 


4.068.000-4.115.000 


pro329 


uuuoyb 




NT 011005.3 
+ 


849 000 000-872 000 000 


pro331 


W27939 


12 


NT 009785.3 
? 


49.701.000-49.745.000 


oro338 


AA768912 


10 


NT 008609.3 


26.001.705-26.042.334 


pro341 


AA877215 


1 


NT 019273.3 


114.182.965-114.223.575 
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63 







13 


NT 009952.3 
+ 


98.753.071-98.793.770 


pro344 


X16277 


2 


NT 005380.3 
+ 


10.809.000-10.860.000 


pro347 


AI032612 


1 


NT 019269.3 


41.189.000-41.265.000 


pro348 


AI032612. 


12 


NT 024394.3 


101.992.000-102.107.000 


pro352 


AI525633 


12 


NT 009785.3 
? 


49.699.479-49.740.007 


pro358 


N95406 


7 


NT 007816.3 
? 


96.997.000-97.064.000 


pro361 


AA255502 


6 


NT 007592.3 
+ 


28.000.000-32.000.000 



Table 4 First nucleic acid sequences indicating accession no. in the human genome 
browser, 12 Dec 2000 draft assembly of the human genome Human genome 
5 browser 



No 


Acc. No. 


Chromosome 


Band 


Bases 


prol 


M87338 


7 


7q11.22 


67380990-67396543 


pro2 


U73379 


20 


20q13.11 


46319851-46338917 j 


pro3 


X05360 


10 


10q21.2 


64194062-64211044 


pro5 


M25753 


5 


5q23.2 


137604120-137622654 j 


5 


5q22.2 


121662724-121679510 \ 


5 


5q13.1 


73649568-73666303 


pro6 


M74558 


1 


1p33 


52577823-52592535 


pro7 


D38073 


6 


6p12.2 


56968405-56986831 


pro8 


AF015254 


17 


17p12 


9089031-9106502 


pro9 


J03626 


3 


3q21.2 


135784683-135799798 


pro10 


U74612 


12 


12p13.33 


3074066-3089514 


, pro13 


D63880 


12 


12p13.31 


6903002-6917288 


pro14 


D14657 


15 


15q22.2 


60215632-60231472 . 


pro15 


AB024704 


20 


20q11.21 


31872458-31887490 



WO 03/000928 
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64 



pro16 


AI375913 
(J04088) 


17 


17q21.2 


42673095-42689004 


pro19 


U37426 


10 


10q23.33 


99600001-99623479 


pro20 


AF098162 


12 


12q13.3 


58470993-58506578 


pro21 


X74330 


12 


12q13.3 


58801599-58817157 


pro22 


L47276 


17 


17q21.2 


42703746-4271 9695 


pro23 


L25876 


14 


14q22.2 


50958791 -50974988 


pro24 


U65410 


4 


4q26 


126153161-126168416 


pro25 


X13293 


20 


20q12 


44147604-44164046 


pro26 


X51688 


4 


4q26 


128042847-128058218 


pro27 


AL080146 


15 


15q22.1 


55104198-55120101 


pro29 


D88357 


10 


10q21.2 


64199199-64214445 


pro30 


D26361 


1 


1q32.1 


2240354 1 4-224051 265 


pro31 


D14678 


6 


6p21.31 


36374334-36389597 


pro32 


AF011468 


20 


20q13.2 


36644429-36659811 


58763968-58779170 


pro34 


AB019987 


3 


3q25.33 


175620714-175635845 


pro36 


AF053306 


15 


15q15.1 


35625667-35641196 


pro38 


AF032862 


5 


5q34 


178469476-178486752 


pro39 


U01038 


16 


16p21.1 


28930379-28945875 


pro40 


D13633 


14 


14q22.3 


29043898-29059035 


51960394-51975921 


pro41 


AJ000186 


4 


4q26 


126153146-126168525 


pro42 


D26018 


11 


11q13.4 


79445361-79460511 


prt>43 


X02308 


18 


18p11.32 


931580-947280 I 


pro44 


AF016371 


1 


1p34.1 


47366616-47382271 


pro45 


U05340 


1 ' 


1p34.1 


48138801-48153800 


pro46 


D80008 


20 


20p11.21 


27258467-27274295 


pro47 


AB000449 


14 


14q32.2 


96123633-96139456 


pro48 


AF017790 


18 


18p11.32 


2615001-2630207 


pro50 


X51688 


4 


4q26 


128041767-128057590 


pro51 


AF081280 


10 


10q24.32 


109292185-109308954 


pro52 


M86699 


6 


6q14.1 


85882001-85897200 


pro54 


X74794 


6 


8q 11.22 


50261007-50276309 


pro56 


AB006624 


12 


12q13.3 


59174501-59189500 


proof 




1 I 


1 7q21 .2 


45056497*45072182 


pro58 


AB018334 


5 


5p13.2 


40319301-40338000 


pro59 


AF091433 


8 


8q22.1 


98382913-98396920 


pro60 


229066 


1 


1q32.3 


237405001-237420000 


pro61 


D79988 


12 


12q24.31 


.133269001-133284000 


133230501-133245500 



WO 03/000928 



PCT/IB02/03534 



65 



pro63 


AL050151 


8 


8q24.3 


1 4oyoUt>o4- 1 «*oyo^uoD 


pro64 


D79997 


9 


9p13.2 


39124o6U-3914oUUU j 


pro65 


L07541 


13 


13q13.1 


3241 5UU1 - J^4 JoUUU 


pro66 


U37139 


1 


1p31.3 


71792205-71807237 


pro67 


Y00272 


10 


10q21.2 


641 94599-6421 1207 


pro68 


D90070 


18 


■4 o o 4 no 

. 18q21.32 


6206 1 499-62077998 


pro69 


U14518 


2 


2p23.3 


26821 351 -26836883 


pro70 


X87843 


14 


14q23.1 


57525402-57541401 


pro71 


U26727 


9 


9p21.3 


23824001-23840000 


pro72 


X52142 


1 


1q42.12 


251 569001-251 584900 


pro75 


L48692 


2 


2p13.3 


6951 2098-69527598 


pro78 


L31801 


1 


1p13.2 


124826201-124843002 


1 24490069-124506743 


pro79 


U15552 


1 


1p22.1 


1 0501 9659- 1 05034740 


pro80 


AF039652 


17 


17p1 1.2 


20094899-201 10488 


pro81 


U79266 


11 


11q13.2 


67334001 -67349000 


pro82 


X92106 


17 


17q11.2 


31700001-31715000 


pro 84 


L07540 


12 


12q24.23 


1 26266001 -1 26283000 


pro85 


U 766 38 


2 


2q35 


219994976-220010675 


pro 88 


L34600 


2 


2p16.1 . 


56591 445-5660691 6 


pro90 


D83781 


11 


11p11.2 


48298842-48316506 


pro92 


X59618 


X 


Xp11.3 


41625001-41640000 


pro 94 


U73704 


1 


1p22.1 


1 02963001 -1 02978500 


pro95 


D50923 


1 


1q42.13 


256927030-2569421 17 


pro96 


M85085 


X 


Xq22.1 


100331001-100346000 


pro98 


AF025840 


14 


14q21.3 


46340081-46355371 


pro99 


AF029670 


17 


17q23.2 


63449001-63464000 


pro100 


AF070559 


13 


13q33.1 


106141001-106156500 


pro101 


S 78085 


6 


6q27 


1 80902275-1 8091 7523 


pro104 


U39817 


15 


15q26.1 


87983001-87998000 


pro105 


L23959 


13 


13q34 


116658001-116673000 


pro 106 


X77743 


2 


2q24.1 


160044918-160062017 


pro107 


U 10564 


11 


11p15.3 


9127192-9142201 


pro108 


AB028069 


7 


7q21.12 


88435001-88450000 


pro109 


AF053977 


5 


5q31.1 


150402501-150422204 


prol 10 


uuyuo/ 


id, 


1 2q23.1 


1 04568858-1 0458522 1 


pro111 


AF073362 


11 


. 11q21 


101407914-101423315 


pro113 


L49054 


3 


3q25.32 


173904528-173934748 


pro114 


X04327 


7 


7q33 


139307570-139325937 


pro115 


AF074723 


14 


14q24.2 


68654211-68669417 


pro116 


AF091092 


1 


1p32.3 


58583305-58598434 



WO 03/000928 



PCT/1B02/03534 



66 



pro117 


AF058918 


19 


19q 13.43 


70037301-70052300 


pro118 


AF025441 


15 


15q15.1 


36875108-36890661 


pro119 


AJ006591 


4 


4q31.1 


148771298-148786657 | 


pro124 


L36529 


I 18 


18p11.32 


254219-270019 


pro125 


X65550 


10 


10q26.2 


138240294-138255077 


pro126 


M34065 


5 


5q31 .2 


150756331-150771379 


pro127 


L02547 


20 


20q13.2 • 


58756755-58772000 


pro128 


U00238 


4 


4q12 


59266501-59283500 


pro130 


L42450 


2 


2q31.1 


176199115-176214588 


pro131 


X98253 


X 


Xq24 


122031634-122047863 


pro133 


U58970 


20 


20q12 


45447501-45463000 


pro134 


AF077953 


I 18 


18q21.1 


47833501-47849000 


pro135 


J00140 


5 


5q14.1 


86483185-86498573 


I 18 


18q11.2 


25676581-25692180 


pro136 


M74093 


19 


19q12 


34192227-34207465 


pro137 


AF029669 


17 


17q23.1 


63211106-63252789 


pro138 


AL050019 


1 


1p36.33 


52302-67336 


pro140 


M68520 


12 


12q13.2 


57983277-57998212 


pro141 


Y13467 


17 


17q12 


41583543^1598142 


pro143 


U64805 


17 


17q21.31 


45619001-45639000 


pro144 


U78082 


14 


14q24.2 


68653184-68668183 


pro145 


Y15164 


X 


Xp22.22 


13061501-13076500 


pro146 


AF008442 


6 


6p21.1 


47871392-47886807 


pro147 


X16901 


13 


13q14.13 


43972520-43990027 


pro148 


U06632 


17 


17q23.1 


61863000-61878839 


pro149 


D26069 


3 


3q29 


215253329-215270320 


pro150 


AF027150 


14 


14q21.1 


35344279-35359894 


pro152 


D13413 


3 


3q29 


211580067-211591811 


pro155 


L20320 


5 


5q13.1 


73642669-73657817 


pro157 


AJ223728 


22 


22q11.21 


16393966-16409005 


pro158 


AB023215 


14 


14q24.3 


74118287-74135032 


pro159 


D32002 


9 


9q22.33 


98867915-98884079 


pro160 


Y18046 


6 


6q27 


177044740-177060727 


pro 162 


U27459 


2 


2q33.2 


206003518-206019965 


pro164 


D86322 


4 


4q31.1 


147998675-148013849 


pro 165 


Z46376 


2 


2p12 


75788712-75806284 


pro167 


AF003540 


11 


11p15.5 


2576355-2591604 


pro168 


D38553 


2 


2q11.1 


94745291-94760990 


pro169 


M60725 


17 


17q23.2 


64866487-64881241 


pro173 


L34673 


3 


3q24 


162499055-162515000 


pro174 


U93867 


1 


1q21.1 


162966980-162983139 



WO 03/000928 



PCT/IB02/03534 



67 



pro175 


U44754 


14 


14q23.1 


58567756-58582473 


pro177 


AL050405 


X 


Xq26.1 


1 3244407 1 - 1 324 59420 


pro179 


U 18937 


5 


5q31.3 


1 53395323-1 5341 1 1 37 


pro180 


X76388 


4 


4q31.21 


1 52639006-1 52654005 


pro181 


AF038662 


3 


3q13.32 


1 293947 1 6-1 2940991 8 


pro182 


AF042169 


10 


10q22.1 


74034210-74049959 


pro185 


U07804 


20 


20q12 


41507121-41523513 


pro187 


M97388 


1 


1p22.1 


104042438-104058335 


pro189 


L35546 


1 


1p22.1 


104986774-105006727 


pro191 


AB007962 


1 


1q21.3 


170103257-170119230 


pro193 


AF006259 


12 


12p13.22 


4781441-4796771 


pro195 


X78627 


2 


2q14.3 


1 23494635-1 2351 1 228 


pro196 


M62810 


10 


10q21,1 


61677452-61693234 


pro197 


AL080116 


6 


6q15 


94998657-95013656 


pro199 


U79256 


10 


10q24.1 


104501013-104516982 


pro201 


AL080088 


16 


16p13.2 


11860550-11875029 


pro203 


M27878 


12 


12q24.33 


144471103-144484312 


pro204 


AF091090 


1 


1p22,2 


99597586-99612101 


pro205 


AF067656 


10 


10q21.1 


59448924-59463923 


pro206 


U03911 


2 


2p16.3 


48415852-48431851 


pro207 


U61145 


7 


7q36.1 


155082360-155098279 


pro208 


D78335 


1 


1q23.3 


187227102-187243552 


pro209 


U30872 


1 


1q41 


240503771-240519770 


pro211 


U97188 


7 


7p15.3 


23230613-23248257 


6 


6q27 


176760451-176776177 


pro213 


D38550 


6 


6p22.3 


22232782-22247803 


pro215 


X15331 


X 


Xq22.3 


108009046-108024961 


pro217 


D87448 


3 


3q22.1 


146018136-146033999 


pro219 


U31556 


8 


8q21.3 


88467142-88482808 


pro220 


X66113 


.1 


1p36.22 


11562849-11578096 


pro221 


M93119 


20 


20p11.23 


21882614-21898000 


pro222 


AB020670 


18 


18q23 


83636442-83651891 


pro223 


AF000430 


12 


12p11.21 


33646020-33661381 


pro224 


U73960 


7 


7p21.3 


11968001-11983000 


pro226 


X76029 


4 


4q12 


58426181-58442374 


pro227 


AF063020 


9 


9p22.3 


16919890-16934979 


pro231 


AJ 132440 


1 


1q32.1 


226438005-226453537 


pro233 


AB024401 


13 


13q34 


113595323-113611692 


pro235 


AL049266 


2 


2q32.1 


190196543-190212071 


pro236 


AB014550 


18 


18p11.32 


2784040-2799481 


. pro237 


X78932 


7 


7q11.21 


64013301-64028356 



WO 03/000928 



PCT/TB02/03534 



68 



pro238 


U22377 


1 


1p34.2 


446839*0-44 6994*9 


pro242 


L 16782 


10 


10q23.31 


96 5331 96-96 548690 


pro243 


AJ001810 


16 


16q12.2 


63707495-63722576 


pro244 


U 16028 


2 


2q31.1 


A TQOOAil A •» 4 700/1 Cft 4 O 

1 78830447-178845919 


pro246 


L08424 


12 


12q23.2 


1 09949544-1 09964900 


pro247 


U07559 


5 


5q11.2 


53964473-53980257 


pro248 


Y10043 


X 


Xq28 


ACCAAATAf\ A CCA O r\r\£\ A 

1 5514471 9-1 551 60264 


pro249 


S74445 


15 


1 5q24.2 


74840^31-74855495 


pro250 


U25165 


3 


3q26.33 


1 96934501 -1 96950372 


pro251 


U63336 


6 


6p21.33 


3366201 9-33677856 


pro253 


U 13022 


7 


7q35 


1 49686 107-1 497023 3 1 


pro255 


AB0 19494 


6 


5p13.2 


39907204-39922203 


pro256 


L07919 


2 


u 2q31.1 


175756788-175772636 


pro257 


AB029006 


2 


2p22.3 


32520 1 44-32540 1 69 


pro258 


AB028995 


17 


17q23.2 


63475124-63497063 


pro259 


U62325 


2 


2q35 


223126524-223141423 


pro262 


X54942 


9 


9q22.1 


8789900 1 -879 1 5976 


pro263 


U 10860 


4 


4p16.1 


9058241-9073583 


pro264 


M87339 


3 ' 


3q27.3 


204899461-20491 51 88 


pro267 


X59543 


11 . 


11p15.5 


4 /\4l A A rt rt r* rt St /-V 

3340168-3358000 


pro269 


M76180 


7 


7p12.2 


51709937-51727010 


pro270 


M92299 


17 


17q21.32 


52711008-52726713 


pro271 


AB028021 


20 


20p11.21 


2536301 7-25378980 


pro272 


Y16752 


6 


6p22.2 


27692102-27707775 


pro275 


i mo 4 O A 

U96131 


5 


5p15.33 


4 4rt<»«/>4 4 4 4r»/\*»/N 

1133261-1148076 


pro277 


X84194 


14 


14q24.3 


73530456-73545798 


pro278 


D82345 


X 


Xq22.1 


1 02209954-1 0222761 0 


pro279 


L3681 8 


1 1 


11q13.4 


^ rt /N f~ A 41 A ^A A i^Arr 

76825241-76843255 


pro282 


V00568 


6 


8q24.13 


4i rt rt r~ rt **▼ -art 4t rt rt rt ^ A ^ A rt 

1 3095271 6-1 30971 51 0 


pro283 


U87459 


X 


Xq28 


1 57900553-1 5791 7852 


pro285 


X 16396 


2 


2p12 


T J AAA"7AA ^ 4ti^\^r\^^r> 

74933738-74953722 


pro286 


A C~f\f\^ A A f\ 

AF007140 


19 


19p13.2 


1 0684785-1 0702574 


pro287 


A ^™ rt f\ S\ A A 

AF053641 


20 


20q13.13 


50999953-51020590 


pro289 


X55110 


11 


11p11.2 


47010261-47026357 


pro290 


U84573 


3 


3q24 


1 59499984- 1 59525599 


Y 
A 


Aqzo.£ 


40"7CCO'trtO -1 07C0 4 rtft"7 

1o7doo»Uo-1 37581997 


pro291 


M97856 


1 


1p34.1 


50925640-50946744 


pro293 


AB011173 


1 


1p36.12 


25196653-25221848 


pro294 


U34994 


3 


3p24.3 - 


25352447-25367957 


pro295 


Y18004. 


X. 


Xp22.13 


18909706-18961865 


pro296 


D78611 


7 


7q32.2 


133415266-133433462 



WO 03/000928 



PCT/IB02/03534 



69 



pro297 


D55716 


7 




1 U 1 000O4#£- 1 U 1 0 f 4044 


pro298 


L19183 


17 


4 *7«4 4 n 

17q11.Z 


2984 1 430-29oo244y 


pro300 


X00737 


14 


14qll.z 


1 6658977-1 oo7oo 1 1 


pro301 


X 14850 
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Preferably, the first nucleic acid sequences are selected from the group consisting of 
prol, pro2 t pro3, pro4, pro5 f pro6, pro7, pro8 t pro9, pro10, pro12, pro13 t pro14 f 
5 pro15, pro16, pro17, pro18, pro19, pro20, pro21, pro22, pro23, pro24, pro25, pro26, 
pro27, pro28, pro29, pro30, pro31 , pro32, pro34, pro36, pro37, pro38, pro39, pro40, 
pro41, pro42, pro43, pro44, pro45, pro46, pro47, pro48, pro49 t pro50, pro51, pro52, 
pro53, pro54, pro55, pro56, pro57, pro58, pro59, pro60, pro61, pro62, pro63, pro64, 
pro65, pro66, pro67, pro68, pro69, pro70, pro71 , pro72, pro73, pro74, pro75 f pro76, 

10 pro77, pro78, pro79, pro80, pro81, pro82, pro83, pro84, pro85, pro86, pro87, pro88, 
pro89, pro90 p pro91, pro92, pro93, pro94, pro95, pro96, pro97 t pro98, pro99, 
pro100, pro101, pro103, pro104, pro105, pro106, pro107, pro108, pro109, pro110, 
pro111, pro112, pro113, pro114, pro115, pro116, pro117, pro118, pro119, pro120, 
pro121, pro122, pro123, pro124, pro125, pro126, pro127, pro128, pro129, pro130, 

15 pro131, pro133 t pro134, pro135, pro136, pro137, pro138, pro139, pro140, pro141, 
proi42, pro143, pro144 t pro145, pro146, pro147, pro148, pro149, pro150, pro152, 
pro153 f pro154, pro155, pro156 f pro157, pro158, pro159, pro160, pro161, pro162, 
pro163, pro164, pro165, pro166, pro167, pro168, pro169 f pro171, pro172, pro173, 
pro174, pro175, pro176 p pro177, pro178, pro179, pro180, pro181, pro182 t pro183, 

20 pro184 f pro185, pro187 ? pro189, pro191, pro193, pro194, pro195, pro196 f pro197, 
pro198, pro199, pro201, pro202, pro203, pro204, pro205, pro206, pro207, pro208, 
pro209, pro210, pro211, pro212, pro213, pro215, pro216, pro217, pro219, pro220, 
pro221, pro222, pro223, pro224, pro225, pro226, pro227, pro228, pro229, pro230, 
pro231, pro232, pro233, pro234, pro235, pro236, pro237, pro238, pro239, pro240, 

25 pro241, pro242, pro243, pro244, pro245, pro246, pro247, pro248, pro249, pro250, 
pro251 ', pro253, pro254, pro255 t pro256, pro257, pro258, pro259, pro260, pro262, 
pro263, pro264, pro267, pro268, pro269, pro270, pro271 t pro272 ( pro273 t pro275, 
pro277, pro278, pro279, pro280, pro282, pro283, pro284, pro285, pro286, pro287, 
pro289, pro290, pro291, pro292, pro293, pro294, pro295, pro296, pro297, pro298, 

30 pro299, pro300, pro301, pro302, pro303, pro304 ( pro305, pro306 t pro307, pro308, 
pro309, pro310, pro311, pro312, pro313, pro315, pro316, pro317, pro318, pro319, 
pro320, pro321, pro322, pro323, pro324, pro326, pro327, pro328, pro329, pro330, 
pro331, pro332 t pro333, pro334, pro335, pro336, pro337, pro338, pro339, pro340, 
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pro341, pro344, pro346, pro347, pro348, pro349, pro352, pro353, pro354, pro355, 
pro356 t pro358, pro359 and pro361. 

More preferably, the first nucleic acid sequence is selected from the group consist- 
5 ing of prol , pro2, pro3, pro4, pro5, pro6, pro7, pro8, pro9, pro10, pro12, pro13, 

pro14, pro15, pro16, pro17, pro18, pro19, pro21, pro22, pro23, pro24, pro25, pro26, 
pro27, pro28, pro29, pro30, pro31 ? pro32, pro34, pro36, pro37, pro38, pro39, pro40, 
pro41, pr642,' pro43, pro44, pro45, pro46, pro47 f pro48, pro49, pro50, pro51, pro52, 
pro53 p pro54, pro56 f pro58, pro59, pro62, pro64, pro65, pro66, pro68, pro69, pro70, 
10 pro71, pro72, pro73, pro74, pro75, pro77, pro78, pro81, pro82, pro85, pro86, pro87, 
pro89, pro90, prb92, pro98, pro103, pro105, pro108, pro120, pro125, pro128, 
pro130, pro133, pro135, pro136, pro137, pro157, pro184, pro205, pro206, pro207, 
pro209, pro210, pro211 f pro212, pro216, pro217, pro219, pro221, pro227, pro231, 
pro233, pro241, pro246, pro248, pro249, pro253 and pro256. 

15 

Yet more preferably, the first nucleic acid sequences are selected from the group 
consisting of prol, pro2, pro3, pro4, pro5, pro7, pro8, pro14, pro15, pro16, pro22, 
pro23, pro24, pro26, pro27, pro29, pro34, pro37, pro38, pro39, pro40, pro45, pro46, 
pr648, pro49, pro50, pro52, pro59, pro69, pro71, pro74, pro77, pro86, pro87, pro89, 
20 pro184, pro205, pro206, pro207, pro209, pro210, pro211, pro221, pro241, pro246, 
prp248 and pro256. 

Even more preferably, the first nucleic acid sequences are selected from the group 
consisting of pro2, pro4, pro8, pro14, pro15, pro16, pro22, pro49 ? pro74 f pro86, 
25 pro87, pro89, pro205, pro221, pro246, 

Most preferably, the first nucleic acid sequences are selected from the group con- 
sisting of pro221, pro210, pro71,pro41,pro30, pro2, pro209, pro14, pro4, pro8, 
pro246, pro16, pro27, pro5, pro49, pro19, pro140, pro139, pro207, pro81 ? pro273 
30 and pro362. 

In another preferred embodiment the first nucleic acid sequences are selected from 
the group consisting of pro!, pro2, pro3, pro4, pro5, pro6, pro7. pro8, pro9, pro10, 
pro12, pro13, pro14, prot5, pro16, pro17, pro18. pro19, pro20, pro21, pro22, pro23, 
35 pro24, pro25, pro26 f pro27, pro28, pro29, pro30, pro31, pro32 t pro34, pro36 t pro37, 
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pro38, pro39, pro40, pro41 , pro42, pro43, pro44, pro45, pro46, pro47 t pro48, pro49, 
pro50, pro51, pro52 t pro53, pro54, pro55, pro56, pro57, pro58, pro59 f pro60, pro61, 
pro62, pro63, pro64, pro65, pro66, pro67, pro68, pro69, pro70, pro71, pro72, pro73, 
pro74, pro75, pro76, pro77 t pro78, pro79, pro80, pro81, pro82, pro83, pro84, pro85, 
5 pro86, pro87, pro88, pro89, pro90, pro91, pro92, pro93, pro94, pro95, pro96, pro97, 
pro98, pro99, prolOQ, pro101, pro103, pro104, pro105, pro106, pro107, pro108, 
pro109, pro110, pro111, pro112, pro113, pro114, pro115, pro116, pro117, pro118, 
pro119, pro120, pro121, pro122, pro123, pro124, pro125, pro126, pro127, pro128. 
pro129 t pro130, pro131, pro133, pro134, pro135 f pro136, pro137 f pro138, pro139, 

10 pro140, pro141, pro142, pro143, pro144, pro145, pro146, pro147, pro148, pro149 f 
pro150, pro152, pro153, pro154, pro155, pro156, pro157, pro158, pro159 t pro160, 
pro161, pro162, pro163 t pro164 f pro165, pro166, pro167, pro168, pro169, pro171, 
pro172, pro173, pro174, pro175, pro176, pro177, pro178, pro179, pro180, pro181, 
pro182, pro183 t pro184, pro185, pro187, pro189, pro191, pro193, pro194, pro195, 

15 pro196, prp197, pro198 t pro199, pro201, pro202, pro203, pro204, pro205, pro206, 
pro207, pro209, pro210, pro211, pro212, pro213, pro216, pro217, pro219, pro220, 
pro221, pro222, pro223, pro224, pro225, pro227, pro228, pro229, pro230, pro231, 
pro233, pro234, pro235, pro236, pro237, pro238, pro239, pro240, pro241 , pro242, 
pro243 t pro244, pro245, pro246, pro248, pro249, pro250, pro251, pro253, pro254, 

20 pro255, pro256, pro257, pro258, pro259, pro260, pro269, pro278, pro282, pro283, 
pro284, pro285, pro297, pro315, pro326, pro327, pro328 and pro329. 

In one especially preferred embodiment of the present invention, the first nucleic 
acid sequence is pro 221 or a fragment thereof or a functional homologue thereof. 
25 Pro221 is the promoter of the gene encoding Insulinoma-associated antigen, IA-1 , 
INSM1 . Insulinoma-associated antigen mRNA is expressed at very high levels by all 
SCLC tested and only expressed at very low levels in brain and adrenal gland. Ex- 
pression has been demonstrated by for example CHIPS analysis and RT-PCR (see 
example 1 and figure 3). 

30 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro210 or a fragment thereof or a functional homologue thereof. Pro210 is 
the promoter of the gene encoding lamin B, LMNB1 . LMNB1 mRNA is expressed at 
very high levels by all tested SCLC and is only expressed at very low levels in 
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spleen, colon and lung. Expression of LMNB1 has been demonstrated by for exam- 
ple CHIPS analysis and RT-PCR (see example 1 and figure 6). 

In another preferred embodiment of the present invention the first nucleic acid se- 
5 quence is pro71 or a fragment thereof or a functional homologue thereof. Pro71 is 
the promoter of the gene encoding p16INK4/MTS1 , MTS1 , cyclin-dependent kinase 
inhibitor 2A, CDKN2A. p16INK4/MTS1 mRNA is expressed at very high levels by 
most SCLC and is only expressed at very low levels or not at all in normal tissues. 
Expression of LMNB1 has been demonstrated by for example CHIPS analysis and 
1 0 RT-PCR (see example 1 and figure 7). 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro41 or a fragment thereof or a functional homologue thereof. Pro41 is 
the promoter of the gene encoding human MAD2L1 , mitotic arrest deficient, yeast 
15 homolog-like 1. MAD2L1 mRNA is expressed at very high levels in SCLC and ex- 
pressed at high levels in testes, but only at low levels in other normal tissues. Ex- 
pression of MAD2L1 has been demonstrated by for example CHIPS analysis and 
RT-PCR (see example 1 and figure 5). 

20 In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro30 or a fragment thereof or a functional homologue thereof. Pro30 is 
the promoter of the human KIAA0042 gene. KIAA0042 RNA is expressed at very 
high levels in most SCLC and in normal tissues it is only expressed at low levels in 
testes. Expression of KIAA0042 has been demonstrated by for example CHIPS 

25 analysis and RT-PCR (see example 1 and figure 4). 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro2 or a fragment thereof or a functional homologue thereof. Pro2 is the 
promoter of the human gene encoding Human cyclin-selective ubiquitin carrier pro- 
30 tein, UBE2C. UBE2C mRNA is expressed at very high levels in most SCLC and in 
normal tissues it is expressed at low levels in spleen and testes. Expression of 
UBE2C has been demonstrated by for example CHIPS analysis (see example 1). 

In another preferred embodiment of the present invention the first nucleic acid se- 
35 quence is pro209 or a fragment thereof or a functional homologue thereof. Pro209 is 
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the promoter of the human gene encoding mitosin, CENPF, centromere protein F. 
CENPF mRNA is expressed at high levels in most SCLC and in normal tissues it is 
expressed at low levels in testes. Expression of CENPF has been demonstrated by 
for example CHIPS analysis (see example 1). 

5 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro14 or a fragment thereof or a functional homologue thereof. Pro14 is 
the promoter of the human KIAA0101 gene. KIAA0101 RNA is expressed at high 
levels in most SCLC and in normal tissues it is expressed at low levels in 7 tissues. 
10 Expression of KIAA0101 has been demonstrated by for example CHIPS analysis 
(see example 1). 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro4 or a fragment thereof or a functional homologue thereof. Pro4 is the 
1 5 promoter of the human gene encoding Cyclin protein gene, Proliferating cell nuclear 
antigen (PCNA). PCNA mRNA is expressed at high levels in most SCLC and in 
normal tissues it is expressed at low levels in colon, spleen and testes. Expression 
of PCNA has been demonstrated by for example CHIPS analysis (see example 1). 

20 In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro8 or a fragment thereof or a functional homologue thereof. Pro8 is the 
promoter of the human gene encoding serine/threonine kinase , STK-1 , STK12, fms- 
related tyrosine kinase 3. STK-1 mRNA is expressed at high levels in SCLC and in 
normal tissues it is expressed at low levels in colon, spleen and testes. Expression 

25 of STK-1 has been demonstrated by for example CHIPS analysis (see example 1 ). 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro246 or a fragment thereof or a functional homologue thereof. Pro246 is 
the promoter of the human gene encoding Achaete scute homologous protein, 
30 ASH1 , ASCL1 . ASH1 mRNA is expressed at high levels in many SCLC and in nor- 
mal tissues it is expressed at low levels in brain. Expression of ASH1 has been 
demonstrated by for example CHIPS analysis (see example 1). 

In another preferred embodiment of the present invention the first nucleic acid se- 
35 quence is pro16 or a fragment thereof or a functional homologue thereof. Pro16 is 
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the promoter of the human gene encoding DNA topoisomerase II alpha (170 kD), 
TOP2A. TOP2A mRNA is expressed at high levels in SCLC and in normal tissues it 
is expressed at low levels in 9 tissues and at high levels in testes. Expression of 
TOP2A has been demonstrated by for example CHIPS analysis (see example 1). 

5 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro27 or a fragment thereof or a functional homologue thereof. Pro27 is 
the promoter of the human gene encoding Cyclin B2, CCNB2. Cyclin B2 mRNA is 
expressed at high levels in SCLC and in normal tissues it is expressed at low levels 
10 in spleen and trachea and at high levels in testes. Expression of Cyclin B2_has been 
demonstrated by for example CHIPS analysis (see example 1). 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro5 or a fragment thereof or a functional homologue thereof. Pro5 is the 
1 5 promoter of the human gene encoding Cyclin B1 , CCNB1 . Cyclin B1 mRNA is ex- 
pressed at high levels in SCLC and in normal tissues it is expressed at low levels in 
colon, spleen and trachea and at high levels in testes. Expression of Cyclin B1_has 
been demonstrated by for example CHIPS analysis (see example 1). 

20 In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro49 or a fragment thereof or a functional homologue thereof. Pro49 is 
the promoter of the human gene encoding Pituitary tumor-transforming 1, PTTG1. 
PTTG1 mRNA is expressed at high levels in SCLC and in normal tissues it is ex- 
pressed at low levels in thyroid, spleen and trachea and at high levels in testes. Ex- 

25 pression of PTTG1 has been demonstrated by for example CHIPS analysis (see 
example 1). 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro19 or a fragment thereof or a functional homologue thereof. Pro19 is 
30 the promoter of the human gene encoding Kinesin-like spindle protein HKSP, 

KNSL1. KNSLImRNA is expressed at high levels in SCLC and in normal tissues it is 
expressed at low levels in colon, small intestine and testes. Expression of KNSL1 
has been demonstrated by for example CHIPS analysis (see example 1). 
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In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro140 or a fragment thereof or a functional homologue thereof. Pro140 is 
the promoter of the human gene encoding cdc2-related protein kinase, CDK2. CDK2 
mRNA is expressed at high levels in SCLC and in normal tissues it is expressed at 
5 high levels in spleen. Expression of CDK2 has been demonstrated by for example 
CHIPS analysis (see example 1). 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro139 or a fragment thereof or a functional homologue thereof. Pro139 is 
10 the promoter of the human gene encoding Serine/threonine protein kinase SAK. 
SAK mRNA is expressed at high levels in most SCLC and in normal tissues it is 
expressed at high levels in testes. Expression of SAK has.been demonstrated by for 
example CHIPS analysis (see example 1). 

15 In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro207 or a fragment thereof or a functional homologue thereof. Pro207 is 
the promoter of the human gene encoding Enhancer of zeste homolog 2 (EZH2). 
EZH2 mRNA is expressed at high levels in SCLC and in normal tissues it is ex- 
pressed at high levels in testes. Expression of EZH2 has been demonstrated by for 

20 example CHIPS analysis (see example 1). 

In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro81 or a fragment thereof or a functional homologue thereof. Pro81 is 
the promoter of the human WSl/79266_gene. HSU79266RHA is expressed at high 
25 levels in most SCLC and in normal tissues it is expressed in testes and spleen. Ex- 
pression of HSU79266 has.been demonstrated by for example CHIPS analysis (see 
example 1). 

In another preferred embodiment of the present invention the first nucleic acid se- 
30 quence is pro273 or a fragment thereof or a functional homologue thereof. Pro273 is 
the promoter of the human gene encoding Rad2, flap structure-specific endonucle- 
ase 1, FEN1. Rad2 mRNA is expressed at high levels in most SCLC and in normal 
tissues it is at low levels in 12 tissues. Expression of Rad2 has.been demonstrated 
by for example CHIPS analysis (see example 1). 



35 
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In another preferred embodiment of the present invention the first nucleic acid se- 
quence is pro362 or a fragment thereof or a functional homologue thereof. Pro362 is 
the promoter of the human gene encoding Protein gene product PGP, ubiquitin car- 
boxyl-terminat esterase L1, ubiquitin thiolesterase. PGP mRNA is expressed at high 
5 levels in most SCLC and in normal tissues it is expressed at low levels in kidney and 
high levels in brain. Expression of PGP has.been demonstrated by for example 
CHIPS analysis (see example 1). 

Functional homologues 

10 

Functional homologues of polypeptides according to the present invention is meant 
to comprise any polypeptide sequence which comprise the same function. For ex- 
ample functional homologues of cell surface molecules are molecules associated 
with the cell surface which can associate with a binding partner and preferably is 
15 capable of internalising the binding partner. Functional homologues of binding part- 
ners are molecules which can associate with the cell surface molecule and which 
preferably is capable of being internalised into cells expressing the cell surface 
molecule. 

20 Functional homologues according to the present invention comprise polypeptides 
with an amino acid sequence, which are sharing at least some homology with the 
predetermined polypeptide sequences as outlined herein above. For example such 
polypeptides are at least about 40 percent, such as at least about 50 percent ho- 
mologous, for example at least about 60 percent homologous, such as at least 

25 about 70 percent homologous, for example at least about 75 percent homologous, 
such as at least about 80 percent homologous, for example at least about 85 per- 
cent homologous, such as at least about 90 percent homologous, for example at 
least 92 percent homologous, such as at least 94 percent homologous, for example 
at least 95 percent homologous, such as at least 96 percent homologous, for exam- 

30 pie at least 97 percent homologous, such as at least 98 percent homologous, for 

example at least 99 percent homologous with any of the predetermined polypeptide 
sequences as outlined herein. 
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The homology between amino acid sequences may be calculated using well known 
algorithms such as for example any one of BLOSUM 30, BLOSUM 40, BLOSUM 45, 
BLOSUM 50, BLOSUM 55, BLOSUM 60, BLOSUM 62, BLOSUM 65, BLOSUM 70, 
BLOSUM 75, BLOSUM 80, BLOSUM 85, and BLOSUM 90. 

Functional homologues may comprise an amino acid sequence that comprises at 
least one substitution of one amino acid for any other amino acid. For example such 
a substitution may be a conservative amino acid substitution or it may be a non- 
conservative substitution. 



A conservative amino acid substitution is a substitution of one amino acid within a 
predetermined group of amino acids for another amino acid within the same group, 
wherein the amino acids within a predetermined groups exhibit similar or substan- 
tially similar characteristics. Within the meaning of the term "conservative amino acid 
15 substitution" as applied herein, one amino acid may be substituted for another within 
groups of amino acids characterised by having 

i) polar side chains (Asp, Glu, Lys, Arg, His, Asn, Gin, Ser, Thr, Tyr, and Cys.) 

i 

i 

20 ii) non-polar side chains (Gly, Ala, Val, Leu, lie, Phe, Trp, Pro, and Met) 

iii) aliphatic side chains (Gly, Ala Val, Leu, He) 

iv) cyclic side chains (Phe, Tyr, Trp, His, Pro) 

25 

v) aromatic side chains (Phe, Tyr, Trp) 

vi) acidic side chains (Asp, Glu) 

30 vii) basic side chains (Lys, Arg, His) 

viii) amide side chains (Asn, Gin) 

ix) hydroxy side chains (Ser, Thr) 
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x) sulphor-containing side chains (Cys, Met), and 

xi) amino acids being monoamino-dicarboxylic acids or monoamino- 
monocarboxylic-monoamidocarboxylic acids (Asp, Glu, Asn, Gin). 

5 

Non-conservative substitutions are any other substitutions. A non-conservative sub- 
stitution leading to the formation of a functional homologue would for example i) dif- 
fer substantially in hydrophobicity, for example a hydrophobic residue (Val, lie, Leu, 
Phe or Met) substituted for a hydrophilic residue such as Arg, Lys, Trp or Asn, or a 

10 hydrophilic residue such as Thr, Ser, His, Gin, Asn, Lys, Asp, Glu or Trp substituted 
for a hydrophobic residue; and/or ii) differ substantially in its effect on polypeptide 
backbone orientation such as substitution of or for Pro or Gly by another residue; 
and/or iii) differ substantially in electric charge, for example substitution of a nega- 
tively charged residue such as Glu or Asp for a positively charged residue such as 

15 Lys, His or Arg (and vice versa); and/or iv) differ substantially in steric bulk, for ex- 
ample substitution of a bulky residue such as His, Trp, Phe or Tyr for one having a 
minor side chain, e.g. Ala, Gly or Ser (and vice versa). 

Functional homologues according to the present invention may comprise more than 
20 one such substitution, such as e.g. two amino acid substitutions, for example three 
or four amino acid substitutions, such as five or six amino acid substitutions, for ex- 
ample seven or eight amino acid substitutions, such as from 10 to 15 amino acid 
substitutions, for example from 15 to 25 amino acid substitution, such as from 25 to 
30 amino acid substitutions, for example from 30 to 40 amino acid substitution, such 
25 as from 40 to 50 amino acid substitutions, for example from 50 to 75 amino acid 
substitution, such as from 75 to 100 amino acid substitutions, for example more than 
100 amino acid substitutions. 

The addition or deletion of an amino acid may be an addition or deletion of from 2 to 
30 5 amino acids, such as from 5 to 10 amino acids, for example from 10 to 20 amino 
acids, such as from 20 to 50 amino acids. However, additions or deletions of more 
than 50 amino acids, such as additions from 50 to 200 amino acids, are also com- 
prised within the present invention. 
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The polypeptides according to the present invention, including any variants and 
functional homologues thereof, may in one embodiment comprise more than 5 
amino acid residues, such as more than 10 amino acid residues, for example more 
than 20 amino acid residues, such as more than 25 amino acid residues, for exam- 
5 pie more than 50 amino acid residues, such as more than 75 amino acid residues, 
for example more than 100 amino acid residues, such as more than 150 amino acid 
residues, for example more than 200 amino acid residues. 

Additional factors may be taken into consideration when determining functional 
10 homologues according to the meaning used herein. For example functional homo- 
logues may be capable of associating with antisera which are specific for the poly- 
peptides according to the present invention. 

In a further embodiment the present invention relates to functional equivalents which 
15 comprise substituted amino acids having hydrophilic or hydropathic indices that are 
within +/-2.S, for example within +/- 2.3, such as within +/- 2.1, for example within +/- 
2.0, such as within +/- 1.8, for example within +/- 1.6,. such as within +/- 1.5, for ex- 
ample within +/- 1 .4, such as within +/- 1 .3 for example within +/- 1 .2, such as within 
+/- 1.1, for example within +/- 1.0, such as within +/- 0.9, for example within +/- 0.8, 
20 such as within +/- 0.7, for example within +/- 0.6, such as within +/- 0.5, for example 
within +/- 0.4, such as within +/- 0.3, for example within +/- 0.25, such as within +/- 
0.2 of the value of the amino acid it has substituted. 

The importance of the hydrophilic and hydropathic amino acid indices in conferring 
25 interactive biologic function on a protein is well understood in the art (Kyte & Doolit- 
tie, 1982 and Hopp, U.S. Pat. No. 4,554,101, each incorporated herein by refer- 
ence). 

The amino acid hydropathic index values as used herein are: isoleucine (+4.5); va- 
30 line (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine 
(+1.9); alanine (+1.8); glycine (-0.4 ); threonine (-0.7 ); serine (-0.8 ); tryptophan (- 
0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); 
aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5) (Kyte & Doolittle, 
1982). 

35 
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The amino acid hydrophilicity values are: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0.+-.1); glutamate (+3.0.+-.1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5.+-.1); alanine (-0.5); histidine (-0.5); cys- 
teine (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine 
5 (^2.3); phenylalanine (-2.5); tryptophan (-3.4) (U.S. 4,554,101). 

Substitution of amino acids can therefore in one embodiment be made based upon 
their hydrophobicity and hydrophilicity values and the relative similarity of the amino 
acid side-chain substituents, including charge, size, and the like. Exemplary amino 
10 acid substitutions which take various of the foregoing characteristics into considera- 
tion are well known to those of skill in the art and include: arginine and lysine; glu- 
. tamate and aspartate; serine and threonine; glutamine and asparagine; and valine, 
leucine and isoleucine. 

15 In addition to the polypeptide compounds described herein, sterically similar com- 
pounds may be formulated to mimic the key portions of the peptide structure and 
that such compounds may also be used in the same manner as the peptides of the 
invention. This may be achieved by techniques of modelling and chemical designing 
known to those of skill in the art. For example, esterification and other alkylations 

20 may be employed to modify the amino terminus of, e.g., a di-arginine peptide back- 
bone, to mimic a tetra peptide structure. It will be understood that all such sterically 
similar constructs fall within the scope of the present invention. 

Peptides with N-terminal alkylations and C-terminal esterifications are also encom- 
25 passed within the present invention. Functional equivalents also comprise glycosyl- 
ated and covalent or aggregative conjugates, including dimers or unrelated chemical 
moieties. Such functional equivalents are prepared by linkage of functionalities to 
groups which are found in fragment including at any one or both of the N- and C- 
termini, by means known in the art. 

30 

Functional equivalents may thus comprise fragments conjugated to aliphatic or acyl 
esters or amides of the carboxyl terminus, alkylamines or residues containing car- 
boxyl side chains, e.g., conjugates to alkylamines at aspartic acid residues; O-acyl 
derivatives of hydroxyl group-containing residues and N-acyl derivatives of the 
35 amino terminal amino acid or amino-group containing residues, e.g. conjugates with 
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Met-Leu-Phe. Derivatives of the acyl. groups are selected from the group of alkyl- 
moieties (including C3 to C10 normal alkyl), thereby forming alkanoyl species, and 
carbocyclic or heterocyclic compounds, thereby forming aroyl species. The reactive 
groups preferably are difunctional compounds known per se for use in cross-linking 
5 proteins to insoluble matrices through reactive side groups. 

Homologues of nucleic acid sequences within the scope of the present invention are 
nucleic acid sequences, which 

10 i) encodes an RNA and/or a protein with similar biological function; 

or 

ii) is capable of exerting a similar biological influence; 
and which is 

15 

a) at least 50% identical, such as at least 60% identical, for example at least 70% 
identical, such as at least 75% identical, for example at least 80% identical, such 
as at least 85% identical, for example at least 90% identical, such as at least 
95% identical 

20 b) or able to hybridise to the complementary strand of said nucleic acid sequence . 
under stringent conditions. 

A similar biological influence within this context may for example be that the nucleic 
acid sequence is capable influencing transcription of second nucleic acid sequences 
25 operably linked thereto in a fashion similar to functional homologous thereof. 

Stringent conditions as used herein shall denote stringency as normally applied in 
connection with Southern blotting and hybridisation as described e.g. by Southern E. 
M., 1975, J. Mol. Biol. 98:503-517. For such purposes it is routine practise to include 
30 steps of prehybridization and hybridization. Such steps are normally performed us- 
ing solutions containing 6x SSPE. 5% Denhardt's, 0.5% SDS, 50% formamide, 100 
\xglm\ denaturated salmon testis DNA (incubation for 18 hrs at 42°C), followed by 
washings with 2x SSC and 0.5% SDS (at room temperature and at 37°C), and a 
washing with 0.1x SSC and 0.5% SDS (incubation at 68°C for 30 min), as described 
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by Sambrook et at., 1989, in "Molecular .Cloning/A Laboratory Manual", Cold Spring 
Harbor), which is incorporated herein by reference. 

Homologous of nucleic acid sequences also encompass nucleic acid sequences 
5 which comprise additions and/or deletions. Such additions and/or deletions may be 
internal or at the end. Additions and/or deletions may be of 1-5 nucleotides, such as 
5 to 10 nucleotide, for example 10 to 50 nucleotides, such as 50 to 100 nucleotides, 
for example at least 100 nucleotides. 



10 Vaccine 

In one embodiment of the present invention the cell surface molecules may be used 
for the preparation of a vaccine. Preferably such a vaccine is capable of raising an 
immune response against the cell surface molecule. Such an immune response 
15 preferably results in specific killing of cells expressing said cell surface molecule. 
Most preferably, the cells expressing the cell surface molecule are malignant cells, 
such as the vaccine results in specific killing of malignant cells. 

Accordingly, the vaccine is preferably suitable for ameliorating and/or curative 
20 and/or prophylactic treatment of a premalignant and/or malignant condition. Hence, 
the vaccine preferably should be administrated to an individual suffering from a 
premalignant and/or malignant conditions, preferably cancer. The individual may be 
any animal, however preferably the individual is a human being. 

25 It is possible to use either the cell surface molecule or fragments thereof or 

derivatives thereof as well as nucleic acids encoding the cell surface molecule or 
fragments thereof or derivatives thereof. Preferred cell surface molecules to use with 
such a vaccine are cell surface molecules which are expressed at a higher level in 
malignant cells in vivo and/or malignant cell lines than in normal tissues. For 

30 example such a cell surface molecule may be identified according to the methods 
outlined herein above. However, other suitable cell surface molecules may also be 
employed. 
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Preferably, the cell surface molecule comprises or essentially consists of or for 
example is GRIA2, such as LPR8, for example is CHRNA5, such as TMEFF, for 
example is NPTXR, such as Transferrin receptor; such as type II membrane protein 
clone: for example is HP10481; such as type II membrane protein clone: such as 
5 HP1 0390; for example is PG40; such as TRC8 ; for example is TR2-1 1 ; such as 
OA3 antigenic surface determinant; for example is integrin alpha 6, For example 
GPIIb; such as vitronectin receptor alpha subunit; for example is integrin alpha-7; 
such as integrin alpha E precursor; for example is integrin alpha 6B; such as integrin 
alpha 5 subunit; for example is integrin beta-5 subunit; such as integrin alpha-3 

1 0 chain; for example is RYK; such as amyloid precursor protein-binding protein 1 ; for 
example is putative transmembrane GTPase; such as membrane cofactor protein; 
FOR EXAMPLE GLVR1 ; for example is Mr 1 10,000 antigen; for example is 
syndecan-1; such as putative seven transmembrane domain protein; for example is 
LCA-homolog/ LAR protein; such as M6 antigen; for example is Me491/CD63 

15 antigen; such as multispanning membrane protein; for example is DDR; such as 

autocrine motility factor receptor; for example is insulin receptor precursor; such as 
IGF1R, for example is insulin-like growth factor II receptor; such as SAS; for 
example is TAPA-1; such as MICB; for example is MHC class II HLA-DR7- 
associated glycoprotein beta-chain; such as HLA-DP; for example is bone small 

20 proteoglycan I biglycan; such as CAR; for example is MEA11; such as interferon- 
gamma receptor alpha chain; for example is Polymeric immunoglobulin receptor; 
such as metabotropic glutamate receptor type 4; for example is metabotropic 
glutamate receptor 8; such as CLPTM1; for example is MAGE-4b; such as 
MAGESa; for example is MAGE-3; such as MAGE-1; for example is MAGE6; such 

25 as MAGE-9; for example is MAGE1 1 ; such as CD24; for example is CD59; such as 
CD44; for example is low density lipoprotein receptor; such as very low density 
lipoprotein receptor; for example is N-CAM; such as lamin B receptor homolog 
TM7SF2; for example is putative T1/ST2 receptor binding protein precursor; such as 
NTR2 receptor; for example is RAGE-4; such as HLA-G1; for example is MOAT-C; 

30 such as alpha 2 delta calcium channel subunit isoform I; for example is LFA-3; such 
as L1-CAM; for example is AVPR2; such as C1 p1 15 C1 ; for example is TE2; such 
as RbP; for example i$ HCF1; such as IRAK; for example is CD151 ; such as surface 
antigen; for example is MAG; such as GPR19; for example is pcta-1 ; such as 
PRAME; for example is vasopressin activated calcium mobilizing receptor-like 

35 protein; such as serotonin receptor 5-HT4B; for example is serotonin 1D receptor (5- 
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HT1D-); such as CD9; for example is LDL receptor member LR3; such as DR6; for 
example is tumor necrosis factor receptor; such as HG38; for example is urokinase- 
type plasminogen receptor; such as FGF receptor; for example is nerve growth 
factor receptor; such as cystine/glutamate transporter; for example is CB1 
5 cannabinoid receptor (CNR1); such as PSG; for example is PSG13'; such as CPE- 
receptor; for example is CRH2R; such as OCI5; for example is TRAIL receptor 2; 
such as HNMP-1; for example is kidney alpha-2-adrenergic receptor; such as 
erythropoietin receptor; for example is chondroitin sulphate proteoglycan.versican 
V1; for example is mGluRlbeta; such as CD97; for example is L6; such as NY-ESO- 

10 1 ; for example is T-cell receptor alpha delta; such as roM ; for example is ror2; such 
as SSTR2; for example is VESPR; such as IgG Fc receptor; for example is 
glutamate receptor subunit GluRC; such as HEK2; for example is PVR; such as 
CEA; for example is CC-chemokine-binding receptor JAB61 ; such as HER2; for 
example is HER3; such as hypothetical protein FLJ22357 similar to Epidermal 

15 growth factor receptor-related protein; for example is putative endothelin receptor 
type B-like protein; such as GLVR2; for example is P2X4 purinoreceptor; such as 
FPRL1; for example is Atrial natriuretic peptide clearance receptor; for example is 
gastrin/CCK-B receptor; such as Neuromedin B receptor; for example is GFRA3; 
such as GRPR; for example is CDH1; such as CDH2; for example is TGFBR1; such 

20 as TGFBR2; for example is TGFBR3; such as precursor of epidermal growth factor 
receptor. 

More preferably, the cell surface molecule is selected from the group consisting of 
NCAM1, NPTXR, LRP8, CHRNA5, GRIA2, GRM8, ITGAV, ITGAE, TNFRSF12, 
25 L1 CAM, GPR49 and TMEFF1 . 

In one preferred embodiment the vaccine furthermore comprise a non-self antigen 
covalently linked to said cell surface molecule. Alternatively, when nucleic acid 
sequences encoding cell surface molecules are used, the vaccine may comprise 
30 second nucleic acid sequences encoding a non-self antigen linked to the nucleic 
acid sequences. 

Examples of non-self antigens which may be used with the present invention are 
invention are T-cell epitopes, preferably polypeptides or peptide. 
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It is also possible that the vaccine comprises more than one antigen, such as 2, for 
example 3, such as 4, for example 5, such as more than 5 different antigens. The 
antigens may be self antigens or non-self antigens. 

5 The vaccine according to the present invention may furthermore comprise an 

adjuvant and /or a carrier. The carrier or adjuvant could be any carrier or adjuvant 
known in the art including functional equivalents thereof. Functionally equivalent 
carriers are capable of presenting the same antigen in essentially the same steric 
conformation when used under similar conditions. Functionally equivalent adjuvants 
10 are capable of providing similar increases in the efficacy of the composition when 
used under similar conditions. 

Preferably, said compositions comprise potent, nontoxic adjuvants that will enhance 
and/or modulate the immunogenicity of immunogenic determinants including anti- 
15 genie determinants including haptenic determinants represent one group of pre- 
ferred adjuvants. In addition, such adjuvants preferably also elicit an earlier, more 
potent, or more prolonged immune response. Such an adjuvant would also be useful 
in cases where an antigen supply is limited or is costly to produce. 

20 Adjuvants pertaining to the present invention may be grouped according to their 
origin, be it mineral, bacterial, plant, synthetic, or host product. The first group under 
this classification is the mineral adjuvants, such as aluminum compounds. Antigens 
precipitated with aluminum salts or antigens mixed with or adsorbed to performed 
aluminum compounds have been used extensively to augment immune responses 

25 in animals and humans. Aluminium particles have been demonstrated in regional 
lymph nodes of rabbits seven days following immunisation, and it may be that an- 
other significant function is to direct antigen to T cell containing areas in the nodes 
themselves. Adjuvant potency has been shown to correlate with intimation of the 
draining lymph nodes. While many studies have confirmed that antigens adminis- 

30 tered with aluminium salts lead to increased humoral immunity, cell mediated immu- 
nity appears to be only slightly increased, as measured by delayed-type hypersensi-. 
, tivity. Aluminium hydroxide has also been described as activating the complement 
pathway. This mechanism may play a role in the local inflammatory response as 
well as immunoglobulin production and B cell memory. Furthermore, aluminium hy- 

35 droxide can protect the antigen from rapid catabolism. Primarily because of their 
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excellent record of safety, aluminum compounds are presently the only adjuvants 
used in humans. 

Another large group of adjuvants is those of bacterial origin. Adjuvants with bacterial 
5 origins can be purified and synthesised (e.g. muramyl dipeptides, lipid A) and host 
mediators have been cloned (Interleukin 1 and 2). The last decade has brought sig- 
nificant progress in the chemical purification of several adjuvants of active compo- 
nents of bacterial origin: Bordetella pertussis, Mycobacterium tuberculosis, lipopoly- 
saccharide, Freund's Complete Adjuvant (FCA) and Freund^ Incomplete Adjuvant 

10 (Difco Laboratories, Detroit, Mich.) and Merck Adjuvant 65 (Merck and Company, 
Inc., Rahway, N.J.). Additionally suitable adjuvants in accordance with the present 
invention are e.g. Titermax Classical adjuvant (SIGMA-ALDRICH), ISCOMS, Quil A, 
ALUN, see US 58767 and 5,554,372, Lipid A derivatives, choleratoxin derivatives, 
HSP derivatives, LPS derivatives, synthetic peptide matrixes, GMDP, and other as 

15 well as combined with immunostimulants (US 5,876,735). 

B. pertussis is of interest as an adjuvant in the context of the present invention due 
to its ability to modulate cell-mediated immunity through action on T-lymphocyte 
populations. For lipopolysaccharide and Freund's Complete Adjuvant, adjuvant ac- 
20 tive moieties have been identified and synthesised which permit study of structure- 
function relationships. These are also considered for inclusion in immunogenic com- 
positions according to the present invention. 

Lipopolysaccharide and its various derivatives, including lipid A, have been found to 
25 be powerful adjuvants in combination with liposomes or other lipid emulsions. It is 
not yet certain whether derivatives with sufficiently low toxicity for general use in 
humans can be produced. Freund's Complete Adjuvant is the standard in most ex- 
perimental studies. 

30 Mineral oil may be added to vaccine formulation in order to protect the antigen from 
rapid catabolism. 



35 



Many other types of materials can be used as adjuvants in immunogenic composi- 
tions according to the present invention. They include plant products such as 
saponin, animal products such as chitin and numerous synthetic chemicals. 
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Adjuvants according to the present invention can also been categorised by their 
proposed mechanisms of action. This type of classification is necessarily somewhat 
arbitrary because most adjuvants appear to function by more than one mechanism. 
5 Adjuvants may act through antigen localisation and delivery, or by direct effects on 
cells making up the immune system, such as macrophages and lymphocytes. An- 
other mechanism by which adjuvants according to the invention enhance the im- 
mune response is by creation of ah antigen depot. This appears to contribute to the 
adjuvant activity of aluminum compounds, oil emulsions, liposomes, and synthetic 
10 polymers. The adjuvant activity of lipopolysaccharides and muramyl dipeptides ap- 
pears to be mainly mediated through activation of the macrophage, whereas B. 
pertussis affects both macrophages and lymphocytes. Further examples of adju- 
vants that may be useful when incorporated into immunogenic compositions ac- 
cording to the present invention are described in US 5,554,372. 

15 

In one preferred embodiment, adjuvants according to the present invention are se- 
lected from the group consisting of aluminium compounds, Freunds incomplete ad- 
juvant, Titermax classical adjuvant and oil emulsions. 

20 There is also provided an embodiment of the present invention wherein the immu- 
nogenic composition further comprises a carrier. The carrier may be present inde- 
pendently of an adjuvant. The purpose of conjugation and/or co-immunisation of an 
antigen and a carrier can be e.g to increase the molecular weight of the antigen in 
order to increase the activity or immunogenicity of the antigen, to confer stability to 

25 the antigen, to increase the biological activity of the determinant, or to increase its 
serum half-life. The carrier protein may be any conventional carrier including any 
protein suitable for presenting antigens. Conventional carrier proteins include, but 
are not limited to, keyhole limpet hemocyanin, serum proteins such as transferrin, 
bovine serum albumin, or human serum albumin, an ovalbumin, immunoglobulins, 

30 or hormones, such as insulin. 

While any suitable pharmaceutical carrier known to those of ordinary skill in the art 
may be employed in the vaccine of this invention, the type of pharmaceutical carrier 
will vary depending on the mode of administration and whether a sustained release 
35 administration is desired. For parenteral administration, such as subcutaneous in- 
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jection, the pharmaceutical carrier may e.g. comprise water, saline, alcohol, fat, a 
wax or a buffer. Biodegradable microspheres (e.g., polylactic galactide) may also be 
employed as pharmaceutical carriers for the pharmaceutical compositions of this 
invention. 

In one embodiment of the present invention, the vaccine involves the use of den- 
dritic cells. Such an embodiment preferably comprises the steps of 

i) providing dendritic cells; and 

ii) transferring nucleic acid sequences encoding a cell surface molecule 
according to the present invention operably linked to second nucleic 
acid sequences directing expression thereof to the dendritic cells or 
transferring a cell surface molecule or a fragment thereof to the den- 
dritic cells; and 

iii) displaying said cell Surface molecules or fragments thereof on the cell 
surface of the dendritic cells; and 

iv) transferring said dendritic cells to the individual to be treated 

Preferably, the dendritic cells are cells derived from the individual to be treated, 
however the dendritic cells may also be derived from another individual. When the 
dendritic cells are derived from another individual, preferably, the cells are derived 
from the same species as the individual to be treated. For example, if the individual 
to be treated is a human being* preferably, the dendritic cells are derived from a 
human being. 

Preferably, the cell surface molecules are displayed on the cell surface as frag- 
ments, such as peptide fragments in the context on MHC molecules. 

Drug target 

Cell surface molecules, which are capable of binding a binding partner and 
internalising said binding partner into cells expressing said cell surface molecule, 
may also be used as drug targets. Preferably, such cell surface molecules are 
expressed at a different level in malignant cell lines compared with normal tissues. 
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More preferably, the cell surface molecules are identified according to the methods 
outlined in the present invention. 

More preferably the cell surface molecule comprises or essentially consists of or for 
5 example is GRIA2, such as LPR8, for example is CHRNA5, such as TMEFF, for 

example is NPTXR, such as Transferrin receptor; such as type II membrane protein 
clone: for example is HP10481; such as type II membrane protein clone: such as 
HP1 0390; for example is PG40; such as TRC8 ; for example is TR2-1 1 ; such as 
OA3 antigenic surface determinant; for example is integrin alpha 6, For example 

10 GPIIb; such as vitronectin receptor alpha subunit; for example is integrin alpha-7; 

such as integrin alpha E precursor; for example is integrin alpha 6B; such as integrin 
alpha 5 subunit; for example is integrin beta-5 subunit; such as integrin alpha-3 
chain; for example is RYK; such as amyloid precursor protein-binding protein 1; for 
example is putative transmembrane GTPase; such as membrane cofactor protein; 

15 FOR EXAMPLE GLVR1 ; for example is Mr 1 1 0,000 antigen; for example is 

syndecan-1; such as putative seven transmembrane domain protein; for example is 
LCA-homolog/ LAR protein; such as M6 antigen; for example is Me491/CD63 
antigen; such as multispanning membrane protein; for example is DDR; such as 
autocrine motility factor receptor; for example is insulin receptor precursor; such as 

20 IGF1R, for example is insulin-like growth factor II receptor; such as SAS; for 
example is TAPA-1; such as MICB; for example is MHC class II HLA-DR7- 
associated glycoprotein beta-chain; such as H LA-DP; for example is bone small 
proteoglycan I biglycan; such as CAR; for example is MEA1 1; such as interferon- 
gamma receptor alpha chain; for example is Polymeric immunoglobulin receptor; 
. 25 such as metabotropic glutamate receptor type 4; for example is metabotropic 
glutamate receptor 8; such as CLPTM1; for example is MAGE-4b; such as 
MAGE5a; for example is MAGE-3; such as MAGE-1 ; for example is MAGE6; such 
as MAGE-9; for example is MAGE11; such as CD24; for example is CD59; such as 
CD44; for example is low density lipoprotein receptor; such as very low density 

30 lipoprotein receptor; for example is N-CAM; such as lamin B receptor homolog 

TM7SF2; for example is putative T1/ST2 receptor binding protein precursor; such as 
NTR2 receptor; for example is RAGE-4; such as HLA-G1; for example is MOAT-C; 
such as alpha 2 delta calcium channel subunit isoform I; for example is LFA-3; such 
as L1-CAM; for example is AVPR2; such as C1 p1 15 C1; for example is TE2; such 

35 as RbP; for example is HCF1 ; such as IRAK; for example is CD151 ; such as surface 
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antigen; for example is MAG; such as GPR19; for example is pcta-1; such as 
PRAME; for example is vasopressin activated calcium mobilizing receptor-like 
protein; such as serotonin receptor 5-HT4B; for example is serotonin 1D receptor (5- 
HT1D-); such as CD9; for example is LDL receptor member LR3; such as DR6; for 
5 example is tumor necrosis factor receptor; such as HG38; for example is urokinase- 
type plasminogen receptor; such as FGF receptor; for example is nerve growth 
factor receptor; such as cystine/glutamate transporter; for example is CB1 
cannabinoid receptor (CNR1); such as PSG; for example is PSG1 3'; such as CPE- 
receptor; for example is CRH2R; such as OCI5; for example is TRAIL receptor 2; 

10 such as HNMP-1 ; for example is kidney alpha-2-adrenergic receptor; such as 

erythropoietin receptor; for example is chondroitin sulphate proteoglycan versican 
V1; for example is mGluRlbeta; such as CD97; for example is L6; such as NY-ESO- 
1 ; for example is T-cell receptor alpha delta; such as rorl ; for example is ror2; such 
as SSTR2; for example is VESPR; such as IgG Fc receptor; for example is 

15 glutamate receptor subunit GluRC; such as HEK2; for example is PVR; such as 
CEA; for example is CC-chemokine-binding receptor JAB61; such as HER2; for 
example is HER3; such as hypothetical protein FU22357 similar to Epidermal 
. growth factor receptor-related protein; for example is putative endothelin receptor 
type B-like protein; such as GLVR2; for example is P2X4 purinoreceptor; such as 

20 FPRL1 ; for example is Atrial natriuretic peptide clearance receptor; for example is 
gastrin/CCK-B receptor; such as Neuromedin B receptor; for example is GFRA3; 
such as GRPR; for example is CDH1; such as CDH2; for example is TGFBR1 ; such 
as TGFBR2; for example is TGFBR3; such as precursor of epidermal growth factor 
receptor. 

25 

Yet more preferably, the cell surface molecule is selected from the group consisting 
of NCAM1, NPTXR, LRP8, CHRNA5, GRIA2, GRM8, ITGAV, ITGAE, TNFRSF12, 
L1CAM, GPR49 and TMEFF1 . 

30 A drug target within the scope of the present invention is a molecule, which can be 
used as a bait, to identify molecules that associates with the drug target and 
accordingly are potential candidates for drugs. Especially such drugs can be used in 
the treatment of a premalignant and/or malignant conditions, when formulated 
accordingly. 



35 
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The present invention furthermore is concerned with methods for identifying novel 
drug targets, which can interact with the binding partners according to the present 
invention (see herein below). 

5 Preferably, such a novel drug target comprise a polypeptide, which is a cell surface 
molecule expressed at a different level in malignant cells compared with normal 
cells. 

Additionally, the present invention also is concerned with the novel drug targets 
10 identified by the above methods. . 



Methods to identify binding partners 

The present invention furthermore provides methods of identifying specific binding 
15 partners. Additionally, the invention provides methods of preparing specific binding 
partners. 

A specific binding partner may be identified/prepared by a number of different 
methods. Any suitable method known to the person skilled in the art may be used 
20 with the present invention depending of the specific embodiment. 

In one embodiment of the present invention, the binding partner is prepared by 
standard methods for preparing specific antibodies. For example such a method 
may involve the following steps: 

25 

i) immunising an animal with said cell surface molecule or a fragment of 
said cell surface molecule; and 

ii) obtaining antibodies from said animal; or 

iii) obtaining cells producing antibodies from said animal and obtaining 
30 antibodies from said cells 

The animal to be immunised may be any animal, preferably a mammal, more 
preferably the animal is selected from the group comprising rabbit, mouse, rat, 
donkey, goat and sheep. . 
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The antibodies are preferably obtained from a serum of the immunised animal. They 
may be purified by any standard method, such as for example by affinity 
chromatography. Antibodies thus obtained are preferably polyclonal antibodies. 

5 

Cell producing antibodies are preferably obtained from the spleen of the immunised 
animals, preferably the cells are B-cells. The antibody producing cells may be fused 
with other cells subsequent to purification from the animal, in order to obtain 
immortal cells. The cells may be cultivated in vitro and antibodies may for example 
10 be recovered from the tissue culture supernatant by any standard method such as 
for example affinity chromatography, or protein A or protein G chromatography. 
These antibodies are often monocional antibodies. 

Subsequently, the antibodies may be humanised by any suitable method known to 
15 the person skilled in the art. 

Antibodies may however also be prepared or identified by other means. For 
example naturally occurring antibodies may be purified from any suitable animal 
including a human being. Antibodies may also be obtained from an expression 
20 library (see herein below). 

In another embodiment of the present invention the binding partner consists of or 
comprises a polypeptide, which may be identified by screening an expression 
library. Any suitable expression library may be used with the present invention. 

25 

The library may be contained within any suitable host cells, for example the host 
cells may be bacterial cells, yeast cells, insect cells or mammalian cells. The library 
may contain nucleic acid sequences encoding polypeptides and/or oligopeptides 
derived from any species, for example viruses, bacteria, yeast, fungi, plants or 
30 animal. Animals may be any animal, preferably mammals, more preferably human 
beings. The library may also contain nucleic acid sequences encoding polypeptides 
and/or oligopeptides, which are synthetic and not naturally occurring. The nucleic 
acid sequences may be contained within any suitable vector, for example a plasmid, 
a virus, a. virus derived vector, a phage, an artificial chromosome or a cosmid. 
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For example the binding partners may be selected from an expression library 
expressing polypeptides and/or oligopeptides. They may also be selected from a 
synthetic combinatorial library expressing polypeptides and/or oligopeptides. 

5 The binding partner may furthermore be identified by screening a phage display 
library of antibodies. Preferably the phage display library is a library of human 
antibodies. 

In yet another embodiment the binding partners are selected from a library of small 
10 chemical compounds. Such a library may comprise any number of different chemical 
compounds, which may be produced, by a number of different reactions. Suitable 
libraries such as for example combinatorial libraries are known to the person skilled 
in the art. 

15 Once a specific binding partner, which can associate with a cell surface molecule or 
a fragment of a cell surface molecule has been identified/prepared, such a binding 
partner should preferably be tested for capability of being internalised. Such tests 
can be performed in a number of suitable ways depending on the nature of the 
binding partner. 

20 

For example, the binding partner may be incubated with cells expressing the cell 
surface molecule or fragment thereof, with which the binding partner can associate. 
Following incubation, the presence and/or absence of the binding partner in the cell 
interior may be detected. Detection may for example be performed taking advantage 

25 . of that the binding partner may have been labelled with a directly or indirectly 
detectable label. Alternatively, the presence of the binding partner may be 
determined using a first species which can interact specifically with the binding 
partner. Such a species may be directly or indirectly labelled or it may be detected 
using a second species, which can interact specifically with the first species and 

30 which may be labelled. It is possible to used a third species, which can interact with 
the second, forth which can interacts with the third and so forth. 
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Binding partners 

5 The specific binding partners according to the present invention are capable of 
interacting with at least one cell surface molecule. However, a specific binding 
• partner may be capable of associating with more than one different cell surface 
molecules. 

10 In one embodiment of the present invention binding partners within are preferably 

binding partners, which are capable of being internalised into a cell expressing a cell 
surface molecule following association with the cell surface molecule. 

The binding partners according to the present invention may be identified by any of 
15 the methods outlined herein above. However, the binding partner may also be 
identified by any other method known to the person skilled in the art. 

Preferably, the binding partner according to the present invention is capable of as- 
sociating with one or more cell surface molecules selected from the group consisting 
20 of receptors which belong to one of the following groups: 

Members of receptor tyrosine kinases 
Members of the integrin family 

Members of the immunoglobulin superfamily adhesion molecules 
25 Members of the heparan sulfate proteoglycan family 

Members of the chondroitin sulfate proteoglycan family 

Members of the MAGE family 

Members of the RAGE family 

Members of the low density lipoprotein receptor family 
30 Members of the cadherin adhesion molecules 

Members of the metabotropic glutamate receptors 

Members of the steroid hormone families 

Members pf the seven transmembrane receptor family 
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Atrial natriuretic peptide clearance receptor 
GFRA3 

Transferrin receptor 

Members of the serine/threonine kinase receptors 

5 

More preferably, the binding partner according to the present invention is capable of 
associating with one or more cell surface molecules selected from the group 
consisting of NCAM1 , NPTXR, LRP8, CHRNA5, GRIA2, GRM8, ITGAV, ITGAE, 
TNFRSF12, L1 CAM, GPR49 an.d TMEFF1. 

10 

Yet more preferably, the binding partner may associate with one or more fragments 
of a cell surface molecule. Preferred fragments of cell surface molecules are 
outlined herein above. Most preferably, the fragments of the cell surface molecules 
are derived from the extracellular part of the cell surface molecule. 

15 

It is preferred that the binding partners according to the present invention may be 
used in pharmaceutical compositions for the treatment of a premalignant and/or 
malignant conditions. 

20 In one embodiment of the present invention the binding partner comprises or 

essentially consists of a polypeptide or an oligopeptide. A polypeptide and/or an 
oligopeptide according to the present invention may be naturally occurring or it may 
be a synthetic polypeptide. v 

25 * In one preferred embodiment the binding partner is an antibody or a fragment of an 
antibody. The antibody may be a polyclonal antibody or a binding fragment thereof 
or it may be a monoclonal antibody or a binding fragment thereof. 

The antibody may be derived from an animal, preferably a mammal, more preferably 
30 a mammal selected from the group consisting of rat, rabbit, mouse, human, donkey, 
goat and sheep.-ln one embodiment the binding partner is a monoclonal antibody 
derived from a mouse or a rat, for example the binding partner is a murine 
monoclonal antibody. 
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The antibody may however also be combinatorial antibody, such as one part of the 
antibody is derived from one species and the other part is derived from another 
species. Furthermore, the antibody may be a synthetic antibody, which is not 
naturally produced. 

5 

For many purposes it is preferred that the antibody is a humanised antibody. 
Especially, if the binding partner should be used for the treatment of a premalignant 
and/or malignant conditions in a human being, it is desirable that the antibody is 
humanised. 

10 

The antibody may also be human antibody. A human antibody may be a naturally 
produced human antibody or it may be identified from a phage display library. 
Furthermore it may be a combinatorial antibody that also comprise parts derived 
from human antibodies, for example identified from a combinatorial library. Such an 
1 5 antibody need no further humanisation. 

The antibody preferably, may interact with the extracellular part of the cell surface 
molecule (see herein above). The antibody may also associate with a 
posttranslational modification of the extracellular part of the cell surface molecule. 
20 Alternatively, the antibody may interacts with any of the fragments of the cell surface 
molecule as outlined herein above. 

Most preferably the antibody is capable of being internalised upon association with 
said cell surface molecule. M.any antibodies, which associate with a cell surface 
25 molecule, are not internalised into a cell expressing the cell surface molecule upon 
association. Preferred antibodies within the scope of the present invention are 
antibodies, which may be internalised into a cell expressing the cell surface 
molecule following association. 

30 In another embodiment of the present invention the binding partner is a naturally 
occurring ligand for said cell surface molecule. A naturally occurring ligand is a 
compound, which under natural conditions associates with the cell surface molecule. 
A naturally occurring ligand may for example selected from the group consisting of 
polypeptides, oligopeptides, hormones, lipids, saccharides, amino acids, 

35 neurotransmitters, nucleotide, nucleoside and combinations thereof. 
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A hormone could for example be a steroid hormone. Steroid hormones may be se- 
lected from the group consisting of androgens, estrogens, progestogens and corti- 
coids. 

5 

Androgens can for example be selected from the group consisting of testosterone, 
dihydrotestosterone, androstenediol, androstenedione, dehydroepiandrosterone 
(DHEA), dehydroepiandrosterone sulfate (DHEA-S) and derivatives thereof. 

1 0 Estrogens can for example be selected from the group consisting of estrion, estra- 
diol, estriol and derivatives thereof. 

A naturally occurring ligand may be purified from an animal including a human being 
by any conventional technique suitable for the ligand of the embodiment. However, 
15 a natural ligand may also be produced in vitro by any method known to the person 
skilled in the art. 

In one embodiment of the present invention the binding partner is a recombinants 
produced ligand for said cell surface molecule. If the ligand is a polypeptide or an . 

20 oligopeptide, the ligand may be produced by transforming a suitable host, such as 
bacteria, yeast, protozoa, animals such as for example mammals, plants, animal 
cells or plant cells with a nucleic acid sequence encoding the ligand operably linked 
to nucleic acid sequences that direct transcriptions and/or translation of the nucleic 
acid sequence in the particular host. Transformation may be performed according to 

25 any conventional technique. Subsequently the ligand may be purified according to 
any standard method. 

In another embodiment the binding partner is a viral protein or comprise a viral 
protein or fragments thereof. A large number of viral proteins are capable of 
30 associating with cell surface molecule. Frequently such an association results in 
internalisation of the virus particle, and hence viral proteins are suitable binding 
partners according to the present invention. 
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Preferably, a viral protein is a viral capsid protein, more preferably the viral capsid 
protein is capable of being internalised into a cell expressing the cell surface 
molecule. 

5 The viral protein may be derived from any virus, preferably a virus which is capable 
of infecting cells which naturally expresses the cell surface molecule. The virus 
could for example be selected from the group consisting of adenovirus, herpes 
simplex virus, influenza virus and members of the lentivirus family. 

10. The binding partner may be recombinantly produced (see herein above) and 
comprise viral capsid protein sequences. Preferably the viral capsid protein 
sequences are the sequences of the viral protein which can associate with the cell 
surface molecule and result in internalisation. 

15 In yet another embodiment of the present invention the binding partner is a small 
chemical compound. Such a small chemical compound is usually synthetically 
produced. It can be produced by any process or combination of processes known to 
the person skilled in the art. 

20 Preferred small chemical compounds can interact with the cell surface molecules 
and/or the fragments of cell surface molecules as outlined herein above. More 
preferably, the small chemical compounds are capable of being internalised into 
cells expressing said cell surface molecules. 

25 In one embodiment the binding partner may be a polypeptide selected from the 

group consisting of EGF, TGF-a, TGF-p, amphiregulin, HB-EGF, epiregulin, beta- 
cellulin, IGF-1, IGF-2, collagen, fibronectin, vitronectin, laminin, amyloid beta-protein 
precursor, interferon y, transferrin, autocrine motility factor, L1, NCAM, cadherin, 
bombesin, neuromedin B, TNF, erythropoietin, interleukin and cholecystokinin B. 

30 Furthermore, the binding partner may for example be an organic compound selected 
from the group consisting of cannabinoid, acetylcholin, dopamine, norepihrine, se- 
rotonin and GABA. In addition the binding partner may for example be an oligopep- 
tide selected from the group consisting of formyl peptide and atrial natriuretic pep- 
tide. Furthermore the binding partner may for example be an amino acid, the binding 

35 partner may be any amino acid, preferably the amino acid is selected from the group 
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consisting of glutamate, glycine and histamine. Additionally, the binding partner may 
for example be a nucleotide selected from the group consisting of ATP and GTP. 
Furthermore, the binding partner may be a hormone such as estrogen, a lipid or a 
saccharide. 

5 

Furthermore, the binding partner according to the present invention may be selected 
from the group consisting of EGF, TGF-a, heregulins, Insulin, IGF-1, PDGF, CSF-1, 
SCF, Flt-3L, VEGF, FGFs1-9, NGF, BDNF, NT-3, NT-4, HGF, MSP, Gas6, Angio- 
poietin-1, ephrinA1-5, ephrinB1-3, GDNF, PEPHC1, TGF-p, Angiotensin, Thrombin, 
10 Adenosine, Adrenalin, Serotonin, deltorphin, Dopamine, PTH, Secretin, VIP, PA- 
CAP, Glucagon, CRF, Bombesin, Bradykinin, NPY, Glutamate, Ca 2 \ GABA, Che- 
mokines and Opioids. 

More preferably, the binding partner may be selected from the group consisting of L- 
15 glutamate, kainate, 5-(bromomethyl)-4-isoxazolepropionic acid (), analogues of glu- 
tamate, substituted quinoxaline 2,3 diones, GYKI52466, 5-l-Willardine, 5-F- 
Willardine, agonist and antagonist ligands to the AMPA ((RS)-a-Amino-3-hydroxy-5- 
methyl-4-isoxazolepropionic acid, NBQX, CNQX, DNQX, GYKI 52466, 6- 
Chlorokynurenic acid, JSTX, L-APA, L-SOP, ACPT, (R,S)-PPG, CPPG, MAP4, (S)- 
20 3,4-DCPG, vitronectin, cytactin, fibronectin, fibrinogen, laminin, MMP-2, osteopon- 
tin, prothrombin, thrombospondin, von Willebrandts Factor, recombinant fragments 
of L1CAM, salmosin, E-cadherin and peptides thereof, including the peptide: 
NRDKETKV, NCAM1 domain Ig Ml, NCAM1 domain Iglll and peptides thereof, 
peptides C3: ASKKPKRNIKA (SEQ ID NO. 305), D3: AKKERQRKDTU (SEQ ID NO 
25 306), D4: ARALNWGAKP (SEQ ID NO 307), monoclonal antibody 123C3, NPTX1 , 
NPTX2, taipoxin,TCBP49, Oxynor, ApoE2, ApoE3, ApoE4, peptides from ApoE 
(E 14 i : -155; LRKLRKRLLRDADDL (SEQ ID NO 308) and its tandem E {14 i ; _i55)2; 
LRKLRKRLLRDADDL-LRKLRKRLL RDADDL (tandem of SEQ ID NO 308)) 
reelin, nicotine, acetylcholine, a-bungarotoxin and carbachol. 

30 

The binding partner according to the present invention should be selected according 
to cell surface molecule employed in the specific embodiment of the invention. 



35 



Hence, in embodiments of the invention, wherein the cell surface molecule is capa- 
ble of internalising a binding partner or a targeting complex, the binding partner is 
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preferably selected from the group consisting of NCAM1 domain Ig l+ll, NCAM1 
domain Iglll and peptides thereof, peptides C3: ASKKPKRNIKA (SEQ ID NO 305), 
D3: AKKERQRKDTU SEQ ID NO 306), D4: ARALNWGAKP (SEQ ID NO 307), 
monoclonal antibody 123C3, NPTX1, NPTX2, taipoxin,TCBP49, Oxynor, ApoE2, 
5 ApoE3, ApoE4, peptides from ApoE (E 14 i;-15 5 ; LRKLRKRLLRDADDL (SEQ ID NO 
308) and its tandem E (14 1H55)2| LRKLRKRLLRDADDL-LRKLRKRLL RDADDL (tan- 
dem of SEQ ID NO 308)) reelin, nicotine, acetylcholine, a-bungarotoxin, carbachol 
and specific internalising antibodies directed against said cell surface molecules. 

10 In embodiments of the invention wherein the cell surface molecule is not capable of 
internalising a binding partner or a targeting complex, the binding partner is prefera- 
bly selected from the group consisting of L-glutamate, kainate, 5-(bromomethyl)-4- 
isoxazolepropionic acid (), analogues of glutamate, substituted quinoxaline 2,3 dio- 
nes, GYKI52466, 5-l-Willardine, 5-F-Willardine, agonist and antagonist ligands to 

1 5 the AMPA ((RS)-a-Amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid, NBQX, 

CNQX, DNQX, GYKI 52466, 6-Chlorokynurenic acid, JSTX, L-APA, L-SOP, ACPT, 
(R,S)-PPG, CPPG, MAP4, (S)-3,4-DCPG, vitronectin, cytactin, fibronectin, fibrino- 
gen, laminin, MMP-2, osteopontin, prothrombin, thrombospondin, von Willebrandts 
Factor, recombinant fragments of L1CAM, salmosin, E-cadherin and peptides 

20 thereof, including the peptide: NRDKETKV (SEQ ID NO 309) and specific antibodies 
directed to said cell surface molecules. 

In one especially preferred embodiment of the present invention the cell surface 
molecule is NCAM1. When the cell surface molecule is NCAM1, then the binding 

25 partner is preferably selected from the group consisting of the first and second Im- 
munoglobulin domains (Ig) of NCAM1 (Kiselyov et a/., 1997), the third Ig domain of 
NCAM1, the adhesion molecule L1 and proteoglycans. Furthermore, the binding 
partner may preferably be selected from the group consisting of synthetic binding 
partners capable of associating with NCAMs including for example a large number 

30 of peptides (1 1 amino acids) identified from a combinatorial peptide library (Ronn et 
a/., 1999), including for example C3: ASKKPKRNIKA (SES ID NO 305), D3: AK- 
KERQRKDTU (SEQ ID NO 306) and D4: ARALNWGAKPK (SEQ ID NO 307)(Ronn 
et a/., 1999). In addition the binding partner may preferably be selected from the 
group consisting of antibodies against NCAM1, preferably monoclonal antibodies 

35 against NCAM1 , for example antibody (123C3) which causes internalisation. 



WO 03/000928 PCT/IB02/03534 

102 

In another preferred embodiment of the invention the cell surface molecule is 
NPTXR. When the cell surface molecule is NPTXR, then the binding is preferably 
selected from the group consisting of Neuronal pentraxin 1 (NP1, NPTX1) and 
5 Neuronal pentraxin 2 (NP2, NPTX2) (Kirkpatrick et a/ M 2000;Dodds et a/., 1997). 
Furthermore, the binding partner may preferably be selected from the group consi- 
sting of the snake venom taipoxin and taipoxin associated calcium-binding protein 
49 (TCBP49) and the taipoxin analogue, Oxynor. In addition the binding partner may 
preferably be selected from the group consisting of antibodies against NPTXR, pref- 
1 0 erably monoclonal antibodies against NPTXR. 

In another preferred embodiment of the invention the cell surface molecule is LRP8. 
When the cell surface molecule is LRP8, then the binding partner is preferably se- 
lected from the group consisting of ApoE2, ApoE3 and ApoE4 and reelin. Further- 

15 more, the binding partner may preferably be selected from the group consisting 
variouys recombinant ApoE isoforms some of which are commercially available. 
However, the natural ApoE isoforms are capable of associating with several recep- 
tors. In addition, the binding partner may preferably be selected from the group con- 
sisting peptides from ApoE, for example (Emihks; LRKLRKRLLRDADDL (SEQ ID 

20 NO 308) and its tandem E (14 i : _i55)?; LRKLRKRLLRDADDL-LRKLRKRLL RDADDL), 
both have been shown to inhibit receptor function (Riddell et a/., 1 999). In addition 
the binding partner may preferably be selected from the group consisting of anti- 
bodies against LRP8, preferably monoclonal antibodies against LRP8. 

25 In another preferred embodiment of the invention the cell surface molecule is 

CHRNA5. When the cell surface molecule is CHRNA5, then the binding partner is 
preferably selected from the group consisting of nicotine, acetylcholine and the toxin 
a-bungarotoxin. Furthermore, the binding partner may be selected from the group 
consisting of synthetic agonists of CHRNA5, for example carbachol. In addition the 

30 binding partner may preferably be selected from the group consisting of antibodies 
against CHRNA5, preferably monoclonal antibodies against CHRNA5. 



35 



In another preferred embodiment of the invention the cell surface molecule is 
L1CAM. When the cell surface molecule is L1CAM, then the binding partner may for 



WO 03/000928 PCT/IB02/03534 

103 

example cornprisean adhesion molecule of the integrin family or a fragment thereof. 
L1CAM is known to bind several adhesion molecules of the integrin family through 
an RGD sequence and of the immunoglobulin family via an oligomannosidic carbo- 
hydrate. In addition the binding partner may preferably be selected from the group 
5 consisting of antibodies against L1 CAM, preferably monoclonal antibodies against 
L1CAM. 

In another preferred embodiment of the invention the cell surface molecule is 
TNFRSF12. When the cell surface molecule is TNFRSF12, then the binding partner 
1 0 may for example an antibody against TNFRSF1 2, preferably a monoclonal antibody 
against TNFRSF12, for example a monoclonal antibody to the extracellular domain 
of TNFRSF12. 

In one especially preferred embodiment of the present invention the cell surface 

15 molecule is GRIA2. When the cell surface molecule is GRIA2, then the binding part- 
ner is preferably selected from the group consisting of L-glutamate and kainate. 
Furthermore, the binding partner may preferably be selected from the group con- 
sisting of synthetic ligands to GRIA2, for example agonist and antagonist ligands to 
the AMPA ((RS)-a-Amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid) receptors. 

20 AMPA receptor ligands are generally either analogues of glutamate or substituted 

quinoxaline 2,3 diones. The antagonists are divided into competitive and modulatory 
site antagonists (reviewed in (Brauner-Osborne ef a/., 2000;Madsen et a/., 2001)). In 
addition, one AMPA antagonist, GYKI52466, has been shown to inhibit tumor cell 
growth (Cavalheiro and Olney, 2001) on cells expressing the GRIA2 receptor. As 

25 the receptor binding of the ligands involves binding of the mayor portions of the li- 
gand, substitutions (such as halogens) can only be performed at very few sites. A 
brome substituted form of AMPA (ABPA) was shown to act as a potent agonist of 
AMPA receptors (Krogsgaard-Larsen ef a/., 1985). The agonist may also be a halo- 
genated form of an agonist, for example williardiine and analogues with different 

30 AMPA receptor affinities (Jane.D, 2001). Many of these show many fold higher af- 
finity for AMPA receptors than AMPA itself. Synthesis of Williardiine and 6- 
azowillardiine halogenated analogues is described in detail in (Jane 'et a/., 1997). 5- 
l-Willardine and 5-F-Willardine are commercially available, also in a 3 H- forms. 
In addition the binding partner may selected from the group consisting of small 

35 molecule antagonists, for example the commercially available NBQX, CNQX, 
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DNQX, GYKI 52466 and 6-Chlorokynurenic acid and the group of larger polyamine 
antagonists of AMPA receptor channels related to the spider toxin JSTX-3 (Yoneda 
et a/., 2001 ). In addition the binding partner may preferably be selected from the 
group consisting of antibodies against GRIA2, preferably monoclonal antibodies 
5 against GRIA2. 

In another preferred embodiment of the invention the cell surface molecule is 
GRM8. When the cell surface molecule is GRM8, then the binding partner may pre- 
ferably be L*glutamate. Furthermore, the binding partner may preferably be selected 

10 from the group consisting of agonists and antagonists, for example the commercially 
available L-APA, L-SOP, ACPT, (R,S)-PPG, CPPG, MAP4, (S)-3 ( 4-DCPG and 
MSOP and their 3 H labelled forms. One agonist, (R.S)-PPG has a 25 fold preference 
for GRM8 (Gasparini et al, 1999) and the agonist (S)-3,4-DCPG displays more than 
1 00 fold selectivity for GRM8 (Bruno et ai , 2001 ;Thomas et al. , 2001 Turner and 

15 Salt, 1999). In addition the binding partner may preferably be selected from the 
group consisting of antibodies against GRM8, preferably monoclonal antibodies 
against GRM8. 

In another preferred embodiment of the invention the cell surface molecule is IT- 

20 GAV. When the cell surface molecule is ITGAV, then the binding partner may prefe- 
rably be selected from the group consisting of vitronectin, cytactin, fibronectin, fi- 
brinogen, laminin, MMP-2, osteopontin, prothrombin, thrombospondin von Wille- 
brandts Factor and ctvp3. <xvp3 has been shown to bind recombinant fragments of 
the neural cell adhesion molecule L1 though the av subunit (Montgomery et a/., 

25 1 996).The natural ligands, such as vitronectin, are also ligands for a number of 

other, ubiquitously expressed integrins and therefore not optimal for specific target- 
ing. Furthermore, the binding partner may preferably be selected from the group 
consisting of disintegrins and ADAMs, for example salmosin or contortrostatin. Dis- 
integrins and ADAMs (A Disintegrin and A Metalloprotease) are a large number of 

30 proteins from snake venoms, which bind with different specificities to different inte- 
grins (Evans, 2001;Huang, 1998). Several disintegrins specific for avp3 and avp5 
have been identified, including recombinantly produced salmosin (Kang et a/., 1999) 
and contortrostatin (Mercer et a/., 1998). In addition, the binding partner may prefer- 
bly be selected from the group consisting of small cyclic peptides and non-peptide 

35 compounds, which are antagonists of <xvp3 binding (Boger et a/., 2001 ;Hartman and 
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Duggan, 2000;Kerr et a/., 2000;Batt ef a/., 2000). In addition the binding partner may 
preferably be selected from the group consisting of antibodies against ITGAV, pref- 
erably monoclonal antibodies against ITGAV. 

5 In another preferred embodiment of the invention the cell surface molecule is IT- 

GAE. When the cell surface molecule is ITGAE, then the binding partner may prefe- 
rably be the cell adhesion molecule E-cadherin or a fragment thereof. The hetero- 
philic binding site on E-cadherin for aEp7 differs from the homophilic binding site of 
E-cadherin with another E-cadherin (Karecla et a/., 1996;Taraszka et a/., 2000). 

10 Preferably the fragment comprises or even more preferably consists of a short pep- 
tide sequence from the first domain of E-cadherin (amino acids 27-34: NRDKETKV 
(SEQ ID NO 309) f which are capable of interfering with the binding of aE(J7 to E- 
cadherin. Furthermore, the binding partner may be selected from the group consist- 
ing of specific aEp7 specific peptides (Brenner and Cepek, 2001). In addition the 

15 binding partner may preferably be selected from the group consisting of antibodies 
against ITGAE, preferably monoclonal antibodies against ITGAE, such as aE spe- 
cific antibodies that may be used as antagonists. 



Complex 

20 

In one embodiment the present invention relates to a complex comprising a cell 
surface molecule and a binding partner. Preferably the cell surface molecule is 
identified by the method disclosed by the present invention. Preferably, the cell 
surface molecule comprises or essentially consists of or for example is GRIA2, such 

25 as LPR8, for example is CHRNA5, such as TMEFF, for example is NPTXR, such as 
Transferrin receptor; such as type II membrane protein clone: for example is 
HP10481; such as type II membrane protein clone: such as HP10390; for example 
is PG40; such as TRC8 ; for example is TR2-1 1; such as OA3 antigenic surface 
determinant; for example is integrin alpha 6, For example GPIIb; such as vitronectin 

30 receptor alpha subunit; for example is integrin alpha-7; such as integrin alpha E 
precursor; for example is integrin alpha 6B; such as integrin alpha 5 subunit; for 
example is integrin beta-5 subunit; such as integrin alpha-3 chain; for example is 
RYK; such as amyloid precursor protein-binding protein 1; for example is putative 
transmembrane GTPase; such as membrane cofactor protein; FOR EXAMPLE 
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GLVR1 ; for example is. Mr 1 1 0,000 antigen; for example is syndecan-1 ; such as 
putative seven transmembrane domain protein; for example is LCA-homolog/ LAR 
protein; such as M6 antigen; for example is Me491/CD63 antigen; such as 
multispanning membrane protein; for example is DDR; such as autocrine motility 
5 factor receptor; for example is insulin receptor precursor; such as IGF1 R, for 

example is insulin-like growth factor II receptor; such as SAS; for example is TAPA- 
1; such as MICB; for example is MHC class II HLA-DR7-associated glycoprotein 
beta-chain; such as HLA-DP; for example is bone small proteoglycan I biglycan; 
such as CAR; for example is MEA1 1 ; such as interferon-gamma receptor alpha 

10 chain; for example is Polymeric immunoglobulin receptor; such as metabotropic 

glutamate receptor type 4; for example is metabotropic glutamate receptor 8; such 
as CLPTM1; for example is MAGE-4b; such as MAGE5a; for example is MAGE-3; 
such as MAGE-1; for example is MAGE6; such as MAGE-9; for example is 
MAGE1 1; such as CD24; for example is CD59; such as CD44; for example is low 

15 density lipoprotein receptor; such as very low density lipoprotein receptor; for 
example is N-CAM; such as lamin B receptor homolog TM7SF2; for example is 
putative T1/ST2 receptor binding protein precursor; such as NTR2 receptor; for 
example is RAGE-4; such as HLA-G1; for example is MOAT-C; such as alpha 2 
delta calcium channel subunit isoform I; for example is LFA-3; such as L1-CAM; for 

20 example is AVPR2; such as C1 p1 15 C1; for example is TE2; such as RbP; for 

example is HCF1; such as IRAK; for example is CD151; such as surface antigen; for 
example is MAG; such as GPR19; for example is pcta-1; such as PRAME; for 
example is vasopressin activated calcium mobilizing receptor-like protein; such as 
serotonin receptor 5-HT4B; for example is serotonin 1D receptor (5-HT1D-); such 

25 as CD9; for example is LDL receptor member LR3; such as DR6; for example is 
tumor necrosis factor receptor; such as HG38; for example is urokinase-type 
plasminogen receptor; such as FGF receptor; for example is nerve growth factor 
receptor; such as cystine/glutamate transporter; for example is CB1 cannabinoid 
receptor (CNR1); such as PSG; for example is PSG13'; such as CPE-receptor; for 

30 example is CRH2R; such as OCI5; for example is TRAIL receptor 2; such as HNMP- 
1 ; for example is kidney alpha-2-adrenergic receptor; such as erythropoietin 
receptor; for example is chondroitin sulphate proteoglycan versican V1; for example 
is mGluRlbeta; such as CD97; for example is L6; such as NY-ESO-1; for example 
is T-cell receptor alpha delta; such as rorl ; for example is ror2; such as SSTR2; for 

.35 example is VESPR; such as IgG Fc receptor; for example is glutamate receptor 
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subunit GluRC; such as HEK2; for example is PVR; such as CEA; for example is 
CC-chemokine-binding receptor JAB61; such as HER2; for example is HER3; such 
as hypothetical protein FLJ22357 similar to Epidermal growth factor receptor-related 
protein; for example is putative endothelin receptor type B-like protein; such as 
5 GLVR2; for example is P2X4 purinoreceptor; such as FPRL1 ; for example is Atrial 
natriuretic peptide clearance receptor; for example is gastrin/CCK-B receptor; such 
as Neuromedin B receptor; for example is GFRA3; such as GRPR; for example is 
CDH1; such as CDH2; for example is TGFBR1; such as TGFBR2; for example is 
TGFBR3; such as precursor of epidermal growth factor receptor. 

10 

More preferably the cell surface molecule may be selected from the group consisting 
of 

Members of receptor tyrosine kinases 

Members of the integrin family 
15 Members of the immunoglobulin superfamily adhesion molecules 

Members of the heparan sulfate proteoglycan family 

Members of the chondroitin sulfate proteoglycan family 

Members of the MAGE family 

Members of the RAGE family 
20 Members of the low density lipoprotein receptor family 

Members of the cadherin adhesion molecules 

Members of the metabotropic glutamate receptors 

Members of the steroid hormone families 

Members of the seven transmembrane receptor family 
25 Atrial natriuretic peptide clearance receptor 

GFRA3 

Transferrin receptor 

Members of the serine/threonine kinase receptors 

30 More preferably, the cell surface molecule is selected from the group consisting of 
NCAM1, NPTXR, LRP8, CHRNA5, GRIA2, GRM8, ITGAV, ITGAE, TNFRSF12, 
L1CAM, GPR49 and TMEFF1. 
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The binding partner of the complex may be any specific binding partner capable of 
interacting with the cell surface molecule. Examples of binding partners are given 
hereinabove. 

5 Targeting complex 

The present invention provides targeting complexes, which comprise a binding 
partner and a bioreactive species. The binding partner should be capable of 
associating with one or more cell surface molecules or fragments thereof as outlined 
10 hereinabove. 

In one preferred embodiment of the present invention, the cell surface molecule, 
which can associate with the binding partner of the targeting complex, is capable of 
internalising the targeting complex. However, in another preferred embodiment of 
15 the present invention, the cell surface molecule is not capable of internalising the 
targeting complex, but merely is capable of associating with the targeting complex. 

More preferably, the cell surface molecule comprises or essentially consists of or for 
example is Transferrin receptor; such as type II membrane protein clone: for exam- 
20 pie is HP10481; such as type II membrane protein clone: such as HP10390; for ex- 
ample is PG40; such as TRC8 ; for example is TR2-1 1 ; such as OA3 antigenic sur- 
face determinant; for example is integrin alpha 6, For example GPIIb; such as vitro- 
nectin receptor alpha subunit; for example is integrin alpha-7; such as integrin alpha 
E precursor; for example is integrin alpha 6B; such as integrin alpha 5 subunit; for 
. 25 example is integrin beta-5 subunit; such as integrin alpha-3 chain; for example is 
RYK; such as amyloid precursor protein-binding protein 1; for example is putative 
transmembrane GTPase; such as membrane cofactor protein; FOR EXAMPLE 
GLVR1 ; for example is Mr 1 1 0,000 antigen; for example is syndecan-1 ; such as 
putative seven transmembrane domain protein; for example is LCA-homolog/ LAR 
30 protein; such as M6 antigen; for example is Me491/CD63 antigen; such as mul- 

tispanning membrane protein; for example is DDR; such as autocrine motility factor 
receptor; for example is insulin receptor precursor; such as IGF1 R, for example is 
insulin-like growth factor II receptor; such as SAS; for example is TAPA-1; such as 
MICB; for example is MHC class II HLA-DR7-assbciated glycoprotein beta-chain; 
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such as HLA-DP; for example is bone small proteoglycan I biglycan; such as CAR; 
. for example is MEA1 1 ; such as interferon-gamma receptor alpha chain; for example 
is Polymeric immunoglobulin receptor; such as metabotropic glutamate receptor 
type 4; for example is metabotropic glutamate receptor 8; such as CLPTM1 ; for ex- 
5 ample is MAGE-4b; such as MAGESa; for example is MAGE-3; such as MAGE-;1 ; 
for example is MAGE6; such as MAGE-9; for example is MAGE1 1 ; such as CD24; 
for example is CD59; such as CD44; for example is low density lipoprotein receptor; 
such as very low density lipoprotein receptor; for example is N-CAM; such as lamin 
B receptor homolog TM7SF2; for example is putative T1/ST2 receptor binding pro- 

10 tein precursor; such as NTR2 receptor; for example is RAGE-4; such as HLA-G1 ; for 
example is MOAT-C; such as alpha 2 delta calcium channel subunit isoform I; for 
example is LFA-3; such as L1-CAM; for example is AVPR2; such as C1 p1 15 C1; 
for example is TE2; such as RbP; for example is HCF1 ; such as IRAK; for example 
is CD151; such as surface antigen; for example is MAG; such as GPR19; for exam- 

15 pie is pcta-1 ; such as PRAME; for example is vasopressin activated calcium mobi- 
lizing receptor-like protein; such as serotonin receptor 5-HT4B; for example is sero- 
tonin 1D receptor (5-HT1D-); such as CD9; for example is LDL receptor member 
LR3; such as DR6; for example is tumor necrosis factor receptor; such as HG38; for 
example is urokinase-type plasminogen receptor; such as FGF receptor; for exarn- 

20 pie is nerve growth factor receptor; such as cystine/glutamate transporter; for exam- 
ple is CB1 cannabinoid receptor (CNR1); such as PSG; for example is PSG13'; 
such as CPE-receptor; for example is CRH2R; such as OCI5; for example is TRAIL 
receptor 2; such as HNMP-1 ; for example is kidney alpha-2-adrenergic receptor; 
such as erythropoietin receptor; for example is chondroitin sulphate proteoglycan 

25 versican V1 ; for example is mGluRI beta; such as CD97; for example is L6; such as 
NY-ESO-1 ; for example is T-cell receptor alpha delta; such as rorl ; for example is 
ror2; such as SSTR2; for example is VESPR; such as IgG Fc receptor; for example 
is glutamate receptor subunit GluRC; such as HEK2; for example is PVR; such as 
CEA; for example is CC-chemokine-binding receptor JAB61 ; such as HER2; for ex- 

30 ample is HER3; such as hypothetical protein FLJ22357 similar to Epidermal growth 
factor receptor-related protein; for example is putative endothelin receptor type B- 
like protein; such as GLVR2; for example is P2X4 purinoreceptor; such as FPRL1 ; 
for example is Atrial natriuretic peptide clearance receptor; for example is gas- 
trin/CCK-B receptor; such as Neuromedin B receptor; for example is GFRA3; such 

35 as GRPR; for example is CDH1 ; such as CDH2; for example is TGFBR1 ; such as 
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TGFBR2; for example is TGFBR3; such as precursor of epidermal growth factor 
receptor, yet more preferably the cell surface molecule is selected from the group 
consisting of receptors which belong to one of the following groups: 

5 Members of receptor tyrosine kinases 
Members of the integrin family 

Members of the immunoglobulin superfamily adhesion molecules 

Members of the heparan sulfate proteoglycan family 

Members of the chondroitin sulfate proteoglycan family 
1 0 Members of the MAGE family 

Members of the RAGE family 

Members of the low density lipoprotein receptor family 

Members of the cadherin adhesion molecules 

Members of the metabotropic glutamate receptors 
1 5 Members of the steroid hormone families 

Members of the seven transmembrane receptor family 

Atrial natriuretic peptide clearance receptor 

GFRA3 

Transferrin receptor 
20 Members of the serine/threonine kinase receptors 

Even more preferably, the cell surface molecules selected from the group consisting 
of NCAM1, NPTXR, LRP8, CHRNA5, GRIA2, GRM8, ITGAV, ITGAE, TNFRSF12, 
L1 CAM, GPR49 and TMEFF1 . 

25 

The bioreactive species according to the present invention may be any species, 
which can directly or indirectly exert a biological influence on a target cell, wherein 
the target cell, is any cell expressing the cell surface molecule and which can 
30 internalise the targeting construct. The biological influence according to the present 
invention may for example be selected from the group consisting of cell cycle arrest, 
protection of cell against toxins and cell death. 
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The bioreactive species may any compound for example it may be a nucleic acid 
sequence, a polypeptide, an oligopeptide, a toxin, a small chemical compound or a 
radioactive isotope. 

5 In one preferred embodiment the bioreactive species is a nucleic acid sequence. 
Preferably, the nucleic acid sequence comprises a second nucleic acid operably 
linked to a first nucleic acid sequence comprising an expression signal. 

The second nucleic acid sequence may in one preferred embodiment encode a 
10 therapeutic protein (see herein below). The nucleic acid sequence encoding a 

therapeutic protein may comprise complementary DNA (cDNA). The term "cDNA" 
used here, is intended to refer to DNA prepared using messenger RNA (mRNA) as 
template. The advantage of using a cDNA, as opposed to genomic DNA or DNA 
polymerised from a genomic DNA or non- or partially-processed RNA template, is 
15 that the cDNA does not contain any non-coding intron sequences but, rather 

comprise the uninterrupted coding region of the corresponding protein. There may 
be times when the full or partial genomic sequence is preferred, however, such as 
where the non-coding regions are required for optimal expression. 

20 In another embodiment the second nucleic acid sequence encodes an antisense 
RNA or part of an antisense RNA. Alternatively, the second nucleic acid sequence 
may comprise or essentially consist of an antisense RNA or part of an antisense 
RNA. 

25 In the context of the present invention the term "antisense RNA" is intended to en- 
compass an RNA sequence transcribed from the non-coding DNA strand of a gene 
or an RNA sequence that is capable of hybridising to an mRNA or fragments thereof 
under stringent conditions. 

30 Preferably, the antisense RNA within the context of the present invention is the an- 
tisense RNA of a gene encoding a protein, which promotes cell survival, cell growth 
and/or cell mobility. More preferably, the antisense RNA is the antisetnse RNA of an 
oncogene or a growth factor. 
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In another embodiment the second nucleic acid sequence encodes or comprises a 
ribozyme. A ribozyme within the present context is a molecule, which comprises at 
least one RNA, which comprises an enzymatic activity. Preferably, ribozymes 
according to the present invention is targeted against RNA of an oncogene or a 
5 protooncogene or growth factors. 

Accordingly, in preferred embodiments of the present invention antisense RNAs or 
ribozymes are targeted against RNA of an oncogene or proto-oncogene or growth 
factors. Examples of growth factors are indicated herein below. 

10 

Oncogenes are a diverse class of genes, whose products may contribute to the 
development and/or advancement of cancer. Proto-oncogenes may under certain 
circumstances or after due to mutations contribute to the development and/or 
advancement of cancer. Oncogene or proto-oncogene may for example be selected 
15 from the group consisting of Ras, Raf, Myc, Syn, Pirn, BMM, FOP, Sis, KGF, Fms, 
Fig, Neu, Trk, Kit, Met, Src, Fyn, Mas, Fes/Fps, Tre, Mer, ABL, BCL3, int-2, Cym, 
Ets, Elk, RhoA, Ski, Wnt-5a, Spi-1, Rap2, p55 and c-tyr. This is not an exhaustive 
list of oncogenes and proto-oncogenes, which may be used with the present 
invention, but merely comprises illustrative examples. 

20 

The second nucleic acid sequences may also encode a tumour suppressor gene to 
be introduced into the cell expressing the cell surface molecule in order to correct 
any endogenous mutations of said tumour suppressor within the cell. The tumour 
suppressor may be any tumour suppressor for example any of the tumour suppres- 
25 sors indicated herein below. 

The first nucleic acid sequences according to the present invention preferably 
comprise an expression signal. Such an expression signal should preferably 
influence the transcription of second nucleic acid sequences operably linked thereto, 
30 Preferably, the first nucleic acids sequences according to the present invention 
influence transcription such as they enhance transcription under specific 
circumstances. 

In one embodiment of the present invention the first nucleic acid sequence 
35 comprises an expression signal, which directs a lower level of expression of a 
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second nucleic acid sequence in malignant cells, compared with non-malignant 
cells. In another embodiment the first nucleic acid sequence comprises an 
expression signal, which directs approximately the same level of expression of a 
second nucleic acid sequence in malignant cells, compared with non-malignant 
5 cells. 

However, in a preferred embodiment of the present invention the first nucleic acid 
sequences directs a higher level of expression of a second nucleic acid sequence in 
malignant cells compared with non-malignant cells. In particular, the first nucleic 
10 acid sequences may be selected from the group consisting of first nucleic acid 
sequences identified according to the methods outlined herein above. 

Preferably, the first nucleic acid sequence is selected from the group consisting of 
prol, pro2, pro3, pro4, pro5, pro6, pro7, pro8, pro9, pro10, pro12, pro13, pro14 f 

15 pro15, pro16, pro17, pro18, pro19, pro20, pro21, pro22, pro23, pro24, pro25, pro26, 
pro27, pro28, pro29, pro30, pro31, pro32, pro34, pro36, pro37, pro38, pro39, pro40, 
pro41, pro42, pro43, pro44, pro45, pro46, pro47, pro48 r pro49, pro50, pro51, pro52, 
pro53, pro54, pro55, pro56, pro57, pro58, pro59, pro60, pro61, pro62, pro63, pro64, 
pro65, pro66, pro67, pro68, pro69, pro70, pro71, pro72, pro73, pro74, pro75, pro76, 

20 pro77, pro78, pro79, pro80, pro81, pro82, pro83, pro84, pro85, pro86, pro87, pro88, 
pro89, pro90, pro91, pro92, pro93, pro94, pro95, pro96, pro97, pro98, pro99, 
pro100, pro101, pro103, pro104, pro105, pro106, pro107, pro108, pro109, pro110, 
pro111, pro112, pro113, pro114, pro115, pro116, pro117, pro118, pro119, pro120, 
pro121, pro122, pro123, pro124, pro125, pro126, pro127, pro128, pro129, pro130, 

25 pro131, pro133, pro134, pro135, pro136, pro137, pro138, pro139, pro140, pro141, 
pro142, pro143, pro144, pro145, pro146, pro147, pro148, pro149, pro150, pro152, 
pro153, pro154, pro155, pro156, pro157, pro158, pro159, pro160, pro161, pro162, 
pro163, pro164, pro165, pro166, pro167, pro168, pro169, pro171, pro172, pro173, 
pro174, pro175, pro176, pro177, pro178, pro179, pro180, pr6181, pro182, pro183, 

30 pro184, pro185, pro187, pro189, pro191, pro193, pro194, pro195, pro196, pro197, 
pro198, pro199, pro201, pro202, pro203, pro204, pro205, pro206, pro207, pro208, 
pro209, pro210, pro211, pro212, pro213, pro215, pro216, pro217, pro219, pro220, 
pro221, pro222, pro223, pro224, pro225, pro226, pro227, pro228, pro229, pro230, 
pro231 , pro232, pro233, pro234, pro235, pro236, pro237, pro238, pro239, pro240, 

35 pro241, pro242, pro243, pro244, pro245, pro246, pro247, pro248, pro249, pro250, 
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pro251, pro253, pro254, pro255, pro256, pro257, pro258, pro259, pro260, pro262, 
pro263, pro264, pro267, pro268, pro269, pro270, pro271, pro272, pro273, pro275, 
pro277, pro278, pro279, pro280, pro282, pro283, pro284, pro285, pro286, pro287, 
pro289, pro290, pro291 , pro292, pro293, pro294, pro295, pro296, pro297, pro298, 
5 pro299, pro300, pro301 , pro302, pro303, pro304, pro305, pro306, pro307, pro308, 
pro309, pro310, pro311, pro312, pro313, pro315, pro316, pro317, pro318, pro319, 
pro320, pro321, pro322, pro323, pro324, pro326, pro327, pro328, pro329, pro330 t 
pro331 , pro332, pro333, pro334, pro335, pro336, pro337, pro338, pro339, pro340, 
pro341, pro344, pro346, pro347, pro348, pro349, pro352, pro353, pro354, pro355, 
1 0 pro356, pro358, pro359 and pro361 . 

The first nucleic acid sequences may furthermore comprise and/or essentially con- 
sist of fragments of nucleic acid sequences selected from the group consisting of 
prol, pro2, pro3, pro4, pro5, pro6, pro7, pro8, pro9, pro10, pro12, pro13, pro14, 

15 pro15, pro16, pro17, pro18, pro19, pro20, pro21, pro22, pro23, pro24, pro25 ( pro26, 
pro27, pro28, pro29, pro30, pro31 , pro32, pro34, pro36, pro37, pro38, pro39 f pro40, 
pro41 t pro42, pro43, pro44, pro45, pro46 t pro47, pro48, pro49, pro50, pro51 f pro52, 
pro53, pro54, pro55, pro56, pro57 f pro58, pro59, pro60, pro61, pro62, pro63, pro64, 
pro65, pro66, pro67, pro68, pro69, pro70, pro71, pro72, pro73, pro74, pro75, pro76, 

20 pro77, pro78, pro79, pro80, pro81, pro82, pro83, pro84 f pro85, pro86, pro87, pro88, 
pro89, pro90, pro91 t pro92, pro93, pro94, pro95, pro96, pro97, pro98, pro99, 
pro100, pro101, pro103, pro104, pro105, pro106, pro107, pro108, pro109, pro110, 
pro111, pro112, pro113, pro114, pro115, pro116, pro117, pro118, pro119, pro120, 
pro121, pro122, pro123, pro124, pro125, pro126, pro127, pro128, pro129, pro130, 

25 pro131, pro133 t pro134, pro135, pro136, pro137, pro138, pro139, pro140, pro141, 
pro142, pro143 t pro144, pro145, pro146, pro147, pro148, pro149, pro150, pro152, 
pro153 f pro154, pro155, pro156 f pro157, pro158, pro159, pro160, pro161, pro162, 
pro163, pro164, pro165, pro166, pro167, pro168, pro169, pro171, pro172, pro173, 
pro174, pro175, pro176, pro177, pro178 p pro179, pro180, pro181 t pro182 ? pro183, 

30 pro184, pro185, pro187, pro189, pro191, pro193, pro194, pro195, pro196, pro197 t 
pro198, pro199, pro201, pro202, pro203, pro204, pro205, pro206, pro207, pro208, 
pro209, pro210, pro211, pro212, pro213 t pro215, pro216, pro217, pro219, pro220, 
pro221, pro222, pro223, pro224, pro225, pro226, pro227, pro228, pro229, pro230, 
pro231, pro232, pro233, pro234, pro235, pro236, pro237, pro238, pro239, pro240 f 

35 pro241, pro242, pro243, pro244, pro245 t pro246, pro247, pro248, pro249, pro250, 
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pro251, pro253, pro254, pro255, pro256, pro257, pro258, pro259, pro260, pro262, 
pro263, pro264, pro267, pro268, pro269, pro270, pro271, pro272, pro273, pro275, 
pro277, pro278, pro279, pro280, pro282, pro283, pro284, pro285, pro286, pro287, 
pro289, pro290, pro291, pro292, pro293, pro294, pro295, pro296, pro297, pro298, 
5 pro299, pro300, pro301, pro302, pro303, pro304, pro305, pro306, pro307, pro308, 
pro309, pro310, pro311, pro312, pro313, pro315, pro316, pro317, pro318, pro319, 
pro320, pro321, pro322, pro323, pro324, pro326, pro327, pro328, pro329, pro330, 
pro331 , pro332, prp333, pro334, PF0335, pro336, pro337, pro338, pro339, pro340, 
pro341, pro344, pro346, pro347, PP0348, pro349, pro352, pro353, pro354, pro355, 
10 pro356, pro358, pro359 and pro361. 

Even more preferably, the first nucleic acid sequences are selected from the group 
consisting of pro221, pro210, pro71, pro41, pro30, pro2, pro209, pro14, pro4, pro8, 
pro246, pro16, pro27, pro5, pro49, pro19 f pro140, pro139, pro207, pro81, pro273 
1 5 . and pro362 and fragments thereof. 

The first nucleic acid sequence may also comprise more than one fragment of 
nucleotide sequences selected from the above-mentioned group. 

20 It is also contained within the present invention that the first nucleic acid sequence 
further comprises nucleic acid sequences not natively associated therewith. The 
nucleic acid sequences not natively associated therewith may for example be a 
transcription factor binding sites, preferably one or more steroid hormone receptor 
binding sites. 

25 

In preferred embodiments of the present invention the first nucleic acid sequences 
may be any first nucleic acid sequence as outlined herein above. 

i 

In certain embodiments, nucleic acid sequences are stably integrated into the ge- 
30 nome of the cell. This integration may be in the cognate location and orientation via 
homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In further embodiments, the nucleic acid 
sequences may be stably maintained in the cell as a separate, episomal segment of 
DNA. Such nucleic acid segments or "episomes" encode sequences sufficient to 



WO 03/000928 



PCT/IB02/03534 



116 

permit maintenance and replication independent of or in synchronisation with the 
host cell cycle. 

The targeting complex may in addition to a binding partner and a bioreactive species 
5 further comprise additional components. Additional components may for example be 
protective components. 

When the bioreactive species is a nucleic acid the targeting complex may further 
comprise a protective capping, wherein said protective capping consists of nucleic 
10 acid sequences attached to the first and/or second nucleic acid sequences. 

The nucleic acid sequences with protective properties may for example comprise a 
modified nucleotide. The modified nucleotide may for example be modified with one 
or more amino acids, amine groups or biotin groups. 

15 In one embodiment of the present invention the bioreactive species is a toxin. A 

toxin is any species which is toxic to a cell expressing the cell surface molecule. For 
example the toxin may be selected from the group consisting of ricin, diphteria toxin, 
pseudomonas exotoxin, streptozotocin or cholera toxin. However, this list of toxins is 
not complete and should not be regarded as limiting to the invention. 

20 

In another embodiment of the present invention the bioreactive species is an inducer 
of apoptosis. Any compound, which is capable of inducing apoptosis directly or 
indirectly, in a cell expressing a cell surface molecule, is an inducer of apoptosis 
within the meaning of the present invention. 

25 

An inducer of apoptosis may be a polypeptide (see herein below) or it may be any 
other kind of compound. For example the inducer of apoptosis may be selected from 
the group consisting of retinoic acid, A23187, Okadaic Acid, Puromycin, w 
Staurosporine, Thapsigargin, Actinomycin D, Camptothecin, Cycloheximide, 
30 Dexamethasone, Etoposide and Glucocorticoid. However, any other inducer of 
apoptosis is also contained within the present invention. 

In yet another embodiment of the present invention the bioreactive species is a ra- 
dioactive isotope. A radioactive isotope may be selected from the group consisting 
35 of (125)1, (131)1, (123)1, (111)ln, (205) Bi, (206)Bi, (213)Bi, (186)Re, (188)Re, 
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(225)Ac, 99mTc, (68)Ga, (62)Cu, (90)Y, (64)Cu, (211)At ? (212)Bi, (177)Lu, (153)Sm 
and (157)Gd. In one embodiment, the radioactive active species may be covalently 
linked to anoth,er species, for example the radioactive species may be covalently 
linked to a binding partner. 

5 

In a still further embodiment of the present invention the bioreactive species is a 
cytostatica. A cytostatica may for example be a drug, which can be used for 
chemotherapy. Drugs suitable for use in chemotherapy are mentioned herein below. 

10 The bioreactive species according to the present invention may be an antagonist of 
a hormone, preferably an antagonist of a hormone selected from the group 
consisting of estrogens, androgens, progesterones, LH and RH. 

Androgens can for example be selected from the group consisting of testosterone, 
15 dihydrotestosterone, androstenediol, androstenedione, dehydroepiandrosterone 
(DHEA), dehydroepiandrosterone sulfate (DHEA-S) and derivatives thereof. 

Estrogens can for example be selected from the group consisting of estrion, estra- 
diol, estriol and derivatives thereof. 

20 

Alternatively, the bioreactive species may be an aromitase inhibitor. 

In one preferred embodiment of the present invention the bioreactive species 
comprises or essentially consists of a polypeptide. In particular such a polypeptide 
25 may be a therapeutic protein. 

The term "therapeutic protein" is intended to refer to any polypeptide introduced into 
a cell for the potential benefit of the cell or to an organism comprising said cell. 
A therapeutic protein may belong to a number of different classes. For example a 
30 therapeutic protein may be a tumour suppressor, a toxic substance or it may be an 
inducer of apoptosis. The therapeutic protein according to the present invention may 
be a protein, which can contribute to a cell cycle arrest. 

In the context of cancer treatment modalities, a particularly useful gene is a tumour 
35 suppressor. During the process of transformation of normal cells to neoplastic cells, 
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the mutation of tumour suppressor genes is thought to play an important role. One 
of the most important functions of a tumour suppressor gene is to attenuate cell divi- 
sion and mediate apoptosis of mutated cells. Tumour suppressor genes are highly 
effective, so that mutation of both alleles of the tumour suppressor gene is neces- 
5 sary to obviate its function. The introduction of a functional tumour suppressor gene 
into a cancer cell with a mutated phenotype is therefore often sufficient to induce cell 
cycle arrest and apoptosis. p53, p73 and p16 are tumour suppressor genes fre- 
quently mutated in lung cancer. Introduction of a wild type version of these genes 
into cancer cells using a therapeutic gene-delivering vector to induce apoptosis is a 
10 possible way to kill cancer cells selectively. There are numerous tumour suppres- 
sors well known to those in the art, preferred examples include p53, p73, p16, Rb, 
APC, DCC, NF-1, NF-2, WT-1, MEN-I, MEN-II, BRCA1 , VHL, FCC and MCC. This 
list is not intended to be exhaustive of the various tumour suppressors known in the 
art but, rather, is exemplary of the more common tumour suppressors. 

15 

Preferably, the therapeutic protein is a tumour suppressor selected from the group 
consisting of p73, p16, Rb, APC, DCC, NF-1, NF-2, WT-1, MEN-1, MEN-II, BRCA1, 
VHL, FCC, MCC, MSH2, PTCH, DPCH, TSC2, CDKN2A and ARF. More preferably, 
the therapeutic protein is p53. 

20 

The important endpoint of therapy for cancer is the killing or elimination of cancer 
cells. One of the commonly used approaches for induction of this event is the intro- 
duction of wild type p53 into cancer cells with mutated p53, resulting in cell cycle 
arrest and induction of apoptosis. The use of p53 as a therapeutic gene is depend- 

25 ent on the status of the endogenous p53 in the cancer cell. Wild type overexpres- 
sion is often efficient, however, overexpression of p53 in combination with overex- 
pression of cell cycle regulating genes, such as p16, may enhance the effect. Other 
cell cycle regulating genes such as p15, p17, p18 or p19 may also be effective in 
combination with p53 or other genes from the p53 family, such as p73. It is also 

30 possible that combination therapy with chemotherapeutic drugs or ionising radiation 
can markedly augment the therapeutic response to p53 gene therapy. 

The. Bcl-2 family of proteins are important regulators of cell death. They are com- 
prised of two opposing factions, the proapoptotic versus the aritiapoptotic members. 
35 All bcl-2 family members share one or more of four highly conserved domains, BH1, 
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BH2 f BH3 and BH4. Bcl-2 family members include, but are not limited to, A1 , mcl-1 , 
bcl-2, bcl-w, bct-x, bax, bad and bak. A1, bcl-2, mcl-1 , bcl-w and bcl-xl (a long form 
of bcl-x) genes encode intracellular membrane proteins shown to block or delay 
apoptosis. Overexpression of these genes has been shown to confer resistance to 
5 apoptosis including that induced by chemotherapy. Antisense oligonucleotides or 
ribozymes directed against these genes and their proteins can be used therapeuti- 
cally to induce apoptosis. 

In contrast, bax, bad, bak and bcl-xs (a short form of bcl-x) are presently known to 
10 promote cell death by inhibiting the protective effects of the antiapoptotic bcl-2 family 
members. A possible method of inducing apoptosis in tumour cells is by introduction 
and overexpression of these genes. 

Caspases (cysteine-aspartic-acid-proteases) are a class of proteins central to the 
15 apoptotic program. These proteases are primarily responsible for the degradation. of 
cellular proteins that lead to the morphological changes seen in cells undergoing 
apoptosis. Caspases are present as inactive pro-enzymes that are activated by 
proteolytic cleavage. At least 12 caspases has been identified in humans. 
Caspases 8, 9 and 3 are situated at pivotal junctions in apoptosis pathways. 
20 Caspase 8 and caspase 9 activate caspase 3 by proteolytic cleavage and caspase 3 
then cleaves vital cellular proteins or other caspases. It is contemplated that the 
introduction and overexpression of one of these caspases will lead to apoptosis in 
cancer cells. 

25 Preferably, the therapeutic protein is an inducer of apoptosis selected from the 

group consisting of Fas/App1 , TNF, TRAIL, TGF-p, caspases, Bak, Bax , Bid, Bik 
and GZMB. 

The bioreactive species according to the present invention may furthermore be an 
30 antibody that bind oncogenic proteins or other proteins involved in the formation of 
cancer. A list of oncogenic protein are given herein above. 

Cancer cells often produce growth factors and growth factor receptors to sustain 
autocrine or paracrine loops that mediate proliferation, angiogenesis and evasion of 
35 the immune system. Accordingly, the bioreactive species may be an antibody, for 
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example an intracellular single chain that inhibits one or more growth factors se- 
lected from the group consisting of TGF-p, VEGF, IGF and growth factor receptors 
such as EGFR. 

5 Additionally the therapeutic protein may be a protein capable of protecting the cell 
against a toxic agent or it may be a protein which is capable of catalysing the 
synthesis of a toxic substance. 

Different systems have been developed where a protein is introduced that mediates 
10 the conversion of a prodrug to a cytotoxic compound. The herpes simplex virus thy- 
midine kinase (HSVrtk) gene converts specific protoxic nucleoside analogs such as 
acyclovir and gancyclovir into potent DNA synthesis inhibitors. Cells capable of ex- 
pressing HSV-tk are rendered extremely sensitive to the drug, while non-HSV-tk 
expressing cells are relatively insensitive. The effects of the prodrug conversion is 
15 not only seen in the HSV-tk transduced cell, but also in the surrounding cells. This 
effect is termed the bystander effect, which is a therapeutic advantage, as it avoids 
the need to transduce 100% of the tumour cells with the HS>V-tk gene. 

Another such drug susceptibility therapeutic protein is the cytosine deaminase (CD). 
20 The CD protein catalyses the conversion of the prodrug 5-fluorocytosine (5FC) to 5- 
fluorouracil (5FU); treatment of CD transduced cells with 5FC results in the conver- 
sion of the 5FC into the antitumour drug 5FU into CD-positive tumour cells. 

The therapeutic protein may furthermore be a toxic protein, such as cytokines, to be 
25 introduced to interfere with the expression of oncogenes and thus inhibit neoplastic 
cell growth. 

The targeting complex according to the present invention may comprise more than 
one different bioreactive species, such as 2, for example 3, such as 4, for example 
30 5, such as more than 5 different bioreactive species. For example the targeting 
complex may comprise more than one first nucleotide sequence encoding a 
therapeutic protein or more than one therapeutic protein, for example 2, such as 3, 
for example 4, such as 5, for example more than 5 first nucleotide sequences 
encoding a therapeutic protein and/or therapeutic proteins. 



35 
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In some embodiments of the present invention the targeting complex further 
comprises a nuclear targeting signal. The nuclear targeting signal directs 
translocation into the nucleus. Certain bioreactive species must enter the nucleus to 
be active and accordingly it is advantageous if they are attached to a nuclear 
5 localisation signal. For example DNA sequences must enter the nucleus in order to 
be transcribed. 

The nuclear targeting signal according to the present invention may be any nuclear 
targeting signal, which is capable of localising to the nucleus. The nuclear targeting 
10 signal may for example be an oligopeptide, preferably the nuclear targeting signal is 
selected from the group consisting of oligopeptide with the following sequences: 

RRMKWKK PDX (SEQ ID 310) 

RVHPYQR QKI (SEQ ID NO 311) 

KRPACTLKPECVQQLLVCSQEAKK HCDA (SEQ ID NO 31 2) 
1 5 PKKKRKV (SEQ ID NO 31 3) (SV40 LrgT) 

GKKRSKA (SEQ ID NO 314) (H2B) 

KAKRQR (SEQ ID NO 315) (v-Rel) 

RGRRRRQR (SEQ ID NO 316) 

RKRRR (SEQ ID NO 317) 
20 PPVKRERTS (SEQ ID NO 31 8) (RanBP3) 

PYLNKRKGKP (SEQ ID NO 319) (Pho4p) 

CYGSKNTGAKKRKIDDA (SEQ ID NO 320) (DNAhelicaseQI) 

KKKKRKREK (SEQ ID NO 321) (LEF-1) 

KKKRRSREK (SEQ ID NO 322) (TCF-1) 
25 Krx{7,9}PQPKKKP (SEQ ID NO 323) (p53) 

KVTKRKHDNEGSGSKRPK (SEQ ID NO 324)(Hum-Ku70) 

RLKKLKCSKx( 1 9}KTKR (SEQ ID NO 325) (GAL4) 

RKRIREDRKx(18}RKRKR TCPTP (SEQ ID NO 326) 

RRERx(4}RPRKIPR (SEQ ID NO 327) (BDV-P) 
30 KKKKKEEEGEGKKK (SE QID NO 328) (act/inn betaA) 

PRPRKIPR (SEQ ID NO 329) (BDV-P) 

PPRIYPQLPSAPT (SEQ ID NO 330) (BDV-P) 
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KKKQKK (SEQIDN0 361) 

REKKEKEQKEKCA (SEQ ID NO 362) 

LEKKVKKKFDWCA (SEQ ID NO 363) 

TEKK[QG]KSILYDCA (SEQ ID NO 364) 
5 SDKKVRSRLIECA (SEQ ID NO 365) 

LKRKLQR (SEQ ID NO 366) 

RRKGKEK (SEQ ID NO 367) 

CKRKTTNADRRKA (SEQ ID NO 368) 

VNEAFETLKRC (SEQ ID NO 369) 
1 0 MPTEERVRKRKESNRESARRSRYRKAAHLK (SEQ ID NO 370) 

KVNSRKRRKEVPGPNGATEED (SEQ ID NO 371) 

PRRGPR HCV (SEQ ID NO 372) 

PRGRRQPIPKARQP (SEQ ID NO 373) 

KRSAEGGNPPKPLKKLR (SEQ ID NO 374) 
1 5 KRKx{1 1)KKKSKK (SEQ ID NO 375) 

EYLSRKGKLEL (SEQ ID NO 376) 

PKRPRDRHDGELGGRKRARG (SEQ ID NO 377) 

KRPAATKKAGQAKKKK (SEQ ID NO 378) 

KRKKEMANKSAPEAKKKK (SEQ ID NO 379) 
20 RKRAFHGDDPFGEGPPDKK (SEQ ID NO 380) 

GGGx{3)KNRRx(6)RGGRN (SEQ ID NO 381) 

YNNQSSNFGPMKGGN (SEQ ID NO 382) 
' PAAKRVKLD (SEQ ID NO 383) 

KRPAEDMEEEQAFKRSR SxGTKRSYxxM (SEQ ID NO 384) 
25 MNKIPIKDLLNPG (SEQ ID NO 385) 

PKKARED (SEQ ID NO 386) 

VSRKRPR (SEQ ID NO 387) 

APTKRKGS (SEQ ID NO 388) 

PNKKKRK (SEQ ID NO 389) 
30 EEDGPQKKKRRL (SEQ ID NO 390) 

PLLKKIKQ (SEQ ID NO 391) 
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PPQKKIKS (SEQ ID NO 392) 
PQPKKKP (SEQ ID NO 393) 
SKRVAKRKL (SEQ ID NO 394) 
IKYFKKFPKD (SEQ ID NO 395) 
5 KTRKHRG (SEQ ID NO 396) 
KHRKHPG (SEQ ID NO 397) 
PQSRKKLR (SEQ ID NO 398) 
H RKYE APR Hx{6}PRKR (SEQ ID NO 399) 
KKEKKKSKK (SEQ ID NO 400) . 

10 

Wherein the name of the protein from which they have been derived is indicated in 
brackets and wherein '[KR]' indicates *K or R\ i.e. any of the two amino acids valid at 
that position, V for 'any amino acid\ 'xfSp'for '9 times x\ and 'x{7,9/'for 'at least 7, at 
most 9 times x\ Amino acids are given in their one-letter code. 

15 

Furthermore, nuclear localisation signal according to the present invention may also 
be mutants of the above mentioned sequences, such as mutants wherein 1 , such as 
2, for example 3, such as 4, for example 5, such as 6, for example 7, such as 8, for 
example 9, such as 10 amino acids have been substituted for any another amino 
20 acid, preferably it is a conservative amino acid substitution (see herein above). 

Mutants wherein 1 , such as 2, for example 3, such as 4, for example 5, such as 6, 
for example 7, such as 8, for example 9, such as 10 amino acids have been deleted 
are nuclear localisation signal according to the present invention. 

25 More preferably, the nuclear targeting signal is the nuclear localisation signal of 
simian virus 40 large tumour antigen. 

In certain embodiments of the present invention the targeting complex further 
comprises a endosomal lytic agent. The targeting complex is frequently taken up 
30 into cells expressing the cell surface molecule by a process known as receptor 
mediated endocytosis and accordingly the targeting complex enters the cell in an 
endosome, which it has to escape in order to avoid degradation. Hence, the 
targeting complex often comprise an endosomal lytic agent. 
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KDCVINKHHRNRCQYCRLQR (SEQ ID NO 331) (TR2) 

Krx{9)KTKK (SEQ ID NO 332)(THOV NP) 

APKRKSGVSKC (SEQ ID NO 333) (PolyomaVPI) 

RKKRRQRRR (SEQ ID NO 334) (HIV-1 Tat) 
5 RQARRNRRRRWR (SEQ ID NO 335) (HIV-1 Rev) 

MPKTRRRPRRSQRKRPPT (SEQ ID NO 336) (Rex) 

KRPM N AFI VWSRDQRRK (SEQ ID NO 337) (SRY) 

PRRRK (SEQ ID NO 338) (SRY) 

KRPMNAFMVWAQAARRK (SEQ ID NO 339) (SOX9) 
1 0 PRRRK (SEQ ID NO 338) (SOX9) 

[KAR]TPIQKHWRPTVLTEGPPVKIRIETGEWE[KA] (SEQ ID NO 340) 

PPRKKRTW (SEQ ID NO 341) 

YKRPCKRSFIRFI (SEQ ID NO 342) 
. LKDVRKRKLGPGH (SEQ ID NO 343) 
15 KRPRP (SEQ ID NO 344) 

RKRKK (SEQ ID NO 345) 

RRSMKRK hVDR (SEQ ID NO 346) 

PAKRARRGYK (SEQ ID NO 347) 

RKCLQAGM N LEARKTKK (SEQ ID NO 348) 
20 RRERNKMAAAKCRNRRR (SEQ ID NO 349) 

KRMRNRIAASKCRKRKL (SEQ ID NO 350) 

KKSKKGRQEALERLKKA (SEQ ID NO 351) 

RKEWLTNFMEDRRQRKL (SEQ ID NO 352) 

KKQTTLAFKPIKKGKKR (SEQ ID NO 353) 
25 RKRKKMPASQRSKRRKT (SEQ ID NO 354) 

RAIKRRPGLDFDDDGEGNSKFLR (SEQ ID NO 355) 

SxGTKRSYxxM (SEQ ID NO 356) 

TKRSxxxM (SEQ ID NO 357) 

RIRKKLR (SEQ ID NO 358) 
30 KRAAEDDEDDDVDTKKQK (SEQ ID NO 359) 

GRKRKKRT (SEQ ID NO 360) 
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Many viruses have developed strategies to escape the endosome and accordingly 
an attenuated virus or parts of a virus may be useful endosomal lytic agents. 
Preferably, the endosomal lytic agent is selected from the group consisting of 
polyethylenimine (PEI), a replication defective virus and a viral protein capside. 
5 More preferably, the endosomal lytic agent may comprise a membrane destabilising 
polypeptide. 

In one embodiment of the present invention the targeting complex further comprises 
chloroquine. Chloroquine may protect against endosomal degradation and its 
10 presence is accordingly desirable in some embodiments of the invention. 

In preferred embodiments of the present invention the bioreactive species and the 
binding partner associates with one another either directly or indirectly. If the 
bioreactive species is a nucleic acid sequence, the binding partner may for example 
15 associate with the bioreactive species via a nucleic acid binding agent covalently 
attached to said binding partner. 

Nucleic acid-binding agents include proteins, polypeptides, peptides, antibodies, 
nucleotides, carbohydrates, fatty acids, organic or inorganic compounds as well as a 
20 combination of these and others. 

Nucleic acid-binding agents may bind to single-stranded or double-stranded DNA, to 
single-stranded or double stranded RNA, by chemical or physical forces or by a 
combination of the two. A nucleic acid-binding agent may (i) have affinity only for the 
25 nucleic acid itself, (ii) have affinity for both the nucleic acid and another molecule, 
thereby forming a bridge between the two or (iii) have indirect affinity for the nucleic 
acid via affinity for another molecule that has affinity for the nucleic acid. 

According to the present invention, the coupling of a nucleic acid-binding agent and 
30 the binding partner must occur in a manner that does not interfere with the binding 
of the binding partner with the cell surface molecule. Preferably, internalisation of 
the targeting complex via receptor-mediated endocytosis is also retained. In an even 
more preferred embodiment, this recognition and internalisation delivers the nucleic 
acid sequences into a target cell in a form suitable for the expression or for interac- 
35 tion with target endogenous nucleic acid. 
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In one embodiment, the nucleic acid-binding agent may insert itself between base 
pairs of double-stranded nucleic acids in an intercalate manner or bind in the minor 
or major groves of double-stranded nucleic acids. 

5 

This binding may be sequence-specific or completely unrelated to sequence. In 
other embodiments, nucleic acids may be cross-linked with other molecules with 
chemically or photochemically reactive groups. 

1 0 In another embodiment of the invention, the nucleic acid-binding agent covalently 
links the nucleic acid to another molecule. In one embodiment, the nucleic acid 
binding agent is one of the coupling agents, such as carbodiimide. However, cova- 
lent coupling of the nucleic acid may alter its specificity and preclude proper gene 
expression or target nucleic acid recognition. Furthermore, linear or single stranded 

15 nucleic acid may be a requirement for covalent coupling of the nucleic acid to the 
binding partner. Finally, nucleic acids are negatively charged molecules which 
means that they may be repelled from cell surfaces, making transfer difficult via the 
endosomal lysis pathway. Therefore, a size and type restriction may be necessary 
for the efficient delivery of nucleic acid directly bound to binding partner. 

20 

An example of a nucleic acid-binding agent, is a polycationic agent that depends on 
electrostatic-dominated binding involving sequence-neutral interactions between the 
cationic groups and the negatively charged phosphates on nucleic acid similar to the 
DNA-binding agent described in WO 96/30536. 

25 

The polycationic agent binds DNA strongly resulting in the formation of a toroid 
complex where the negative charge of nucleic acid molecule is completely neutral- 
ised. This soluble toroid complex may be internalised via normal receptor-mediated 
endocytosis. 

30 

Any type of nucleic acid may be used, from single stranded mRNA to double 
stranded circular plasmids. 

Furthermore, any size. of nucleic acid may be used, as long as there is a source of 
35 negative charge for the polycationic agent to bind. In certain embodiments, these 
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polycationic moieties may include a natural polyamine such as spermine and/or 
spermidine. In a preferred embodiment, the polycationic agent may be an artificially 
produced agent, such as polylysine or polyethyleneimine. 

5 In order for the invention to function properly, certain criteria with regard to the nu- 
cleic acid-binding agent need to be fulfilled. First, the nucleic acid to be delivered 
into the cell must bind to the nucleic acid binding agent without loosing its integrity in 
anyway. 

10 Secondly, the complex comprising of ligand, nucleic acid binding agent and nucleic 
acid must be in soluble form to allow greater accessibility of the complex to cells in 
vitro and in vivo. Thirdly, once the complex is internalised within the host cell, the 
nucleic acid must have access to its target sequence while avoiding degradation. 

15 The nucleic acid binding agent may include agents such as carbodiimides, N- 

succinimidyl,3 (2-pyridyldithio) propionate, succinimmidyl,4-(N-maleimidomethyl) 
cyclohexane-l-carboxylate, diisocyanates, glutaraldehyde, diazobenzenes, and hex- 
amethylene diamines. This list is not intended to be exhaustive of the various cou- 
pling agents known in the art but, rather, is exemplary of the more common linking 

20 agents that may be used. 

Preferably, the nucleic acid binding agent is selected from the group consisting of 
poly-L-lysine (PLL), spermine, spermidine and histone proteins. 

25 When the nucleic acid binding agent is PLL, PLL may be comprising from 15 to 

1000, such as from 50 to 750, for example from 100 to 500, such as from 200 to 400 
residues. 

In one embodiment of the present invention the binding partner associates with the 
30 bioreactive species indirectly via a pair of specific interacting components wherein 
one component is covalently attached to the bioreactive species and the second 
component is covalently attached to the binding partner. 

One example of such a pair of specific interacting components is biotin and 
35 streptavidin, however other pairs of interacting components may also be used. 
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Complex comprising cell surface molecule and targeting complex 

It is one objective of the present invention to provide complexes that comprise a cell 
5 surface molecule, a binding partner and a bioreactive species. Example of cell sur- 
faces molecules, binding partner and bioreactive species are given herein above. 

Preferably, the complex may comprise a cell surface molecule identified according 
to any of the methods according to the present invention and a targeting complex as 
10 described herein above. 

Alternatively, the complex may comprise a cell surface molecule and a targeting 
complex as described herein above, wherein said cell surface molecule preferably 
comprises or essentially consists of or for example is GRIA2, such as LPR8, for 

15 example is CHRNA5, such as TMEFF, for example is NPTXR, such as Transferrin 
receptor; such as type II membrane protein clone: for example is HP10481; such as 
type II membrane protein clone: such as HP10390; for example is PG40; such as 
TRC8 ; for example is TR2-1 1; such as OA3 antigenic surface determinant; for 
example is integrin alpha 6, For example GPIIb; such as vitronectin receptor alpha 

20 subunit; for example is integrin alpha-7; such as integrin alpha E precursor; for 
example is integrin alpha 6B; such as integrin alpha 5 subunit; for example is 
integrin beta-5 subunit; such as integrin alpha-3 chain; for example is RYK; such as 
amyloid precursor protein-binding protein 1 ; for example is putative transmembrane 
GTPase; such as membrane cofactor protein; FOR EXAMPLE GLVR1; for example 

25 is Mr 1 1 0,000 antigen; for example is syndecan-1 ; such as putative seven 

transmembrane domain protein; for example is LCA-homolog/ LAR protein; such as 
M6 antigen; for example is Me491/CD63 antigen; such as multispanning membrane 
protein; for example is DDR; such as autocrine motility factor receptor; for example 
is insulin receptor precursor; such as IGF1R, for example is insulin-like growth factor 

30 II receptor; such as SAS; for example is TAPA-1; such as MICB; for example is 
MHC class II HLA-DR7-associated glycoprotein beta-chain; such as H LA-DP; for 
example is bone small proteoglycan I biglycan; such as CAR; for example is 
MEA1 1 ; such as interferon-gamma receptor alpha chain; for example is Polymeric 
immunoglobulin receptor; such as metabotropic glutamate receptor type 4; for 
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example is metabotropic glutamate receptor 8; such as CLPTM1 ; for example is 
MAGE-4b; such as MAGE5a; for example is MAGE-3; such as MAGE-1; for 
example is MAGE6; such as MAGE-9; for example is MAGE11; such as CD24; for 
example is CD59; such as CD44; for example is low density lipoprotein receptor; 

5 such as very low density lipoprotein receptor; for example is N-CAM; such as lamin 
B receptor homolog TM7SF2; for example is putative T1/ST2 receptor binding 
protein precursor; such as NTR2 receptor; for example is RAGE-4; such as HLA-G1; 
for example is MOAT-C; such as alpha 2 delta calcium channel subunit isoform I; for 
example is LFA-3; such as L1-CAM; for example is AVPR2; such as C1 p115 C1; 

1.0 for example is TE2; such as RbP; for example is HCF1 ; such as IRAK; for example 
is CD151; such as surface antigen; for example is MAG; such as GPR19; for 
example is pcta-1 ; such as PRAME; for example is vasopressin activated calcium 
mobilizing receptor-like protein; such as serotonin receptor 5-HT4B; for example is 
serotonin 1 D receptor (5-HT1 D~); such as CD9; for example is LDL receptor 

15 member LR3; such as DR6; for example is tumor necrosis factor receptor; such as 
HG38; for example is urokinase-type plasminogen receptor; such as FGF receptor; 
for example is nerve growth factor receptor; such as cystine/glutamate transporter; 
for example is CB1 cannabinoid receptor (CNR1); such as PSG; for example is 
PSG13'; such as CPE-receptor; for example is CRH2R; such as OCI5; for example 

20 is TRAIL receptor 2; such as HNMP-1 ; for example is kidney alpha-2-adrenergic 
receptor; such as erythropoietin receptor; for example is chondroitin sulphate 
proteoglycan versican V1; for example is niGluRlbeta; such as CD97; for example 
is L6; such as NY-ESO-1 ; for example is T-cell receptor alpha delta; such as rorl ; 
for example is ror2; such as SSTR2; for example is VESPR; such as IgG Fc 

25 receptor; for example is glutamate receptor subunit GluRC; such as HEK2; for 
example is PVR; such as CEA; for example is CC-chemokine-binding receptor 
JAB61; such as HER2; for example is HER3; such as hypothetical protein FLJ22357 
similar to Epidermal growth factor receptor-related protein; for example is putative 
endothelin receptor type B-like protein; such as GLVR2; for example is P2X4 

30 purinoreceptor; such as FPRL1 ; for example is Atrial natriuretic peptide clearance 
receptor; for example is gastrin/CCK-B receptor, such as Neuromedin B receptor; 
for example is GFRA3; such as GRPR; for example is CDH1 ; such as CDH2; for 
example is TGFBR1 ; such as TGFBR2; for example is TGFBR3; such as precursor 
of epidermal growth factor receptor. 



35 
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More preferably, the cell surface molecule may be selected from the group 
consisting of NCAM1 , NPTXR, LRP8, CHRNA5, GRIA2, GRM8, ITGAV, ITGAE, 
TNFRSF12, L1CAM, GPR49 and TMEFF1. 

5 Conditions 

A premalignant and/or malignant conditions may for example be cancer or a 
conditions which may develop into a cancer. The term cancer within the scope of the 
present invention covers both malignant and benign tumours, as well as leukaemia. 

10 

Cancer may for example be adenomas, carcinomas or sarcomas. Cancer may for 
example be selected from the group consisting of melanoma, brain tumours, neuro- 
blastomas, breast cancer, lung cancer,, prostate cancer, cervix cancer, uterine can- 
cer, ovarian cancer, leukaemia, colon cancer, rectum cancer, cancer of the testis, 
1 5 cancer of the kidney, cancer of the liver, cancer of the lip, cancer of the tongue, can- 
cer of the stomach, skin cancer, sarcomas, mesotheliomas, bladder cancer, bone 
tumours, malignant pleural effusions, ascites, meningeal carcinomatosis, head and 
neck cancers and cancers of endocrine organs such as: thyroid gland, pituitary 
gland and suprarenal gland. 

20 

Lung cancer may for example be cancers selected from the group comprising small 
cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC). Preferably, the 
premalignant and/or malignant conditions is small cell lung cancer. 

25 In one preferred embodiment the premalignant and/or malignant conditions is breast 
cancer. 

In another preferred embodiment the premalignant and/or malignant conditions is a 
brain tumour. Brain tumours may for example be selected from the group comprising 
30 glioblastomas, neuroblastomas, astrocytomas, oligodendrogliomas, meningiomas, 
medulloblastomas, neuronomas, ependymomas, craniopharingiomas, pineal 
tumours, germ cell tumours and schwannomas. 
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Administration and pharmaceutical compositions 

The individual to receive treatment is any animal, however, preferably the individual 
is a human being. 

5 The treatment according to the present invention may be ameliorating treatment, it 
may be curative treatment and/or it may be prophylactic treatment. 

The main routes of drug delivery according to the present invention are intravenous, 
oral and subcutaneous, as will be described below. Other drug-administration meth- 
10 ods, such as topical delivery, which are effective to deliver the drug to a target site 
or to introduce the drug into the bloodstream, are also contemplated. The com- 
pounds may also be administered by inhalation, that is by intranasal and oral inhala- 
tion administration. 

15 The mucosal membrane to which the pharmaceutical preparation of the invention is 
administered may be any mucosal membrane of the mammal to which the biologi- 
cally active substance is to be given, e.g. in the nose, vagina, eye, mouth, genital 
tract, lungs, gastrointestinal tract, or rectum. 

20 Compounds of the invention may preferably be administered parenterally, that is by 
intravenous, intramuscular, subcutaneous intranasal, intrarectal, intravaginal or in- 
traperitoneal administration. The subcutaneous and intramuscular forms of paren- 
teral administration are generally preferred. Appropriate dosage forms for such ad- 
ministration may be prepared by conventional techniques. 

25 

Preferably, the targeting complex according to the present invention is administrated 
parenterally, more preferably the targeting complex is administrated by intravenous 
injection and/or by subcutaneous injection. 

30 The compounds according to the invention may be administered with at least one 
other compound. The compounds may be administered simultaneously, either as 
separate formulations or combined in a unit dosage form, or administered sequen- 
tially. 
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The dosage requirements will vary with the particular drug composition employed, 
the route of administration and the particular individual being treated. Ideally, an 
individual to be treated by the present method will receive a pharmaceutical^ effec- 
tive amount of the compound in the maximum tolerated dose, generally no higher 
5 than that required before drug resistance develops. 

The individual dosages of a targeting complex will be determined by the nature and 
extent of the condition being treated, the form, route and site of administration, and 
the particular patient being treated, and that such optimums can be determined by 
10 conventional techniques. It will also be appreciated by one of skill in the art. that the 
optimal course of treatment, i.e., the number of doses of a compound or a pharma- 
ceutically acceptable salt thereof given per day for a defined number of days, can be 
ascertained by. those skilled in the art using conventional course of treatment deter- 
mination tests. 

15 

The term "unit dosage form" as used herein refers to physically discrete units suit- 
able as unitary dosages for human and animal individuals, each unit containing a 
predetermined quantity of a compound, alone or in combination with other agents, 
calculated in an amount sufficient to produce the desired effect in association with a 

20 pharmaceutical^ acceptable diluent, carrier, or vehicle. The specifications for the 
unit dosage forms of the present invention depend on the particular compound or 
compounds employed and the effect to be achieved, as well as the pharmacody- 
namics associated with each compound in the host. The dose administered should 
be an " effective amount" or an amount necessary to achieve an "effective level" in 

25 the individual patient. 

Since the "effective level" is used as the preferred endpoint for dosing, the actual 
dose and schedule can vary, depending on interindividual differences in pharmaco- 
kinetics, drug distribution, and metabolism. The "effective level" can be defined, for 
30 example, as the blood or tissue level desired in the individual that corresponds to a 
concentration of one or more compounds according to the invention. 

Pharmaceutical compositions containing a compound of the present invention may 
be prepared by conventional techniques, e.g. as described in Remington: The Sci- 
35 ence and Practice of Pharmacy 1995, edited by E. W. Martin, Mack Publishing 
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Company, 19th edition, Easton, Pa. The compositions may appear in conventional 
forms, for example capsules, tablets, aerosols, solutions, suspensions or topical 
applications. 

5 Pharmaceutical acceptable salts of the compounds according to the present inven- 
tion should also be considered to fall within the scope of the present invention. 
Pharmaceutical^ acceptable salts are prepared in a standard manner. If the parent 
compound is a base it is treated with an excess of an organic or inorganic acid in a 
suitable solvent. If the parent compound is an acid, it is treated with an inorganic or 
10 organic base in a suitable solvent. 

The compounds of the invention may be administered in the form of an alkali metal 
or earth alkali metal salt thereof, concurrently, simultaneously, or together with a 
pharmaceutical^ acceptable carrier or diluent, especially and preferably in the form 
15 of a pharmaceutical composition thereof, whether by oral, rectal, or parenteral (in- 
cluding subcutaneous) route, in an effective amount. 

Examples of pharmaceutical^ acceptable acid addition salts for use in the present 
inventive pharmaceutical composition include those derived from mineral acids, 
20 such as hydrochloric, hydrobromic, phosphoric, metaphosphoric, nitric and sulfuric 
acids, and organic acids, such as tartaric, acetic, citric, malic, lactic, fumaric, ben- 
zoic, glycolic, gluconic, succinic, p-toluenesulphonic acids, and arylsulphonic, for 
example. 

25 Whilst it is possible for the compounds or salts of the present invention to be ad- 
ministered as the raw chemical, it is preferred to present them in the form of a 
pharmaceutical formulation. Accordingly, the present invention further provides a 
pharmaceutical formulation, for medicinal application, which comprises a compound 
of the present invention or a pharmaceutical^ acceptable salt thereof, as herein 

30 defined, and a pharmaceutical^ acceptable carrier therefor. 

The compounds of the present invention may be formulated in a wide variety of oral 
administration dosage forms. The pharmaceutical compositions and dosage forms 
may comprise the compounds of the invention or its pharmaceutical^ acceptable 
35 • salt or a crystal form thereof as the active component. The pharmaceutical^ accept- 
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able carriers can be either solid or liquid. Solid form preparations include powders, 
tablets, pills, capsules, cachets, suppositories, and dispersible granules. A solid car- 
rier can be one or more substances which may also act as diluents, flavouring 
agents, solubilisers, lubricants, suspending agents; binders, preservatives, wetting 
5 agents, tablet disintegrating agents, or an encapsulating material. 

Preferably, the composition will be about 0.5% to 75% by weight of a compound or 
compounds of the invention, with the remainder consisting of suitable pharmaceuti- 
cal excipients. For oral administration, such excipients include pharmaceutical 
10 grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, tal- 
cum, cellulose, glucose, gelatin, sucrose, magnesium carbonate, and the like. 

In powders, the carrier is a finely divided solid which is a mixture with the finely di- 
vided active component. In tablets, the active component is mixed with the carrier 

15 having the necessary binding capacity in suitable proportions and compacted in the 
shape and size desired. The powders and tablets preferably containing from one to 
about seventy percent of the active compound. Suitable carriers are magnesium 
carbonate, magnesium stearate, talc, sugar, lactose, pectin, dextrin, starch, gelatin, 
tragacanth, methylcellulose, sodium carboxymethylcellulose, a low melting wax, 

20 cocoa butter, and the like. The term "preparation" is intended to include the formula- 
tion of the active compound with encapsulating material as carrier providing a cap- 
sule in which the active component, with or without carriers, is surrounded by a car- 
rier, which is in association with it. Similarly, cachets and lozenges are included. 
Tablets, powders, capsules, pills, cachets, and lozenges can be as solid forms suit- 

25 able for oral administration. 

Drops according to the present invention may comprise sterile or non-sterile aque- 
ous or oil solutions or suspensions, and may be prepared by dissolving the active 
ingredient in a suitable aqueous solution, optionally including a bactericidal and/or 

30 fungicidal agent and/or any other suitable preservative, and optionally including a 
surface active agent. The resulting solution may then be clarified by filtration, trans- 
. ferred to a suitable container which is then sealed and sterilized by autoclaving or 
maintaining at 98-100°C for half an hour. Alternatively, the solution may be sterilised 
by filtration and transferred to the container aseptically. Examples of bactericidal and 

35 fungicidal agents suitable for inclusion in the drops are phenylmercuric nitrate or 
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acetate (0.002%), benzalkonium chloride (0.01%) and chlorhexidine acetate 
(0.01%). Suitable solvents for the preparation of an oily solution include glycerol, 
diluted alcohol and propylene glycol. 

5 Also included are solid form preparations, which are intended to be converted, 
shortly before use, to liquid form preparations for oral administration. Such liquid 
forms include solutions, suspensions, and emulsions. These preparations may con- 
tain, in addition to the active component, colorants, flavours, stabilisers, buffers, 
artificial and natural sweeteners, dispersants, thickeners, solubilising agents, and 

10 the like. 

Other forms suitable for oral administration include liquid form preparations including 
emulsions, syrups, elixirs, aqueous solutions, aqueous suspensions, toothpaste, gel 
dentrifrice, chewing gum, or solid form preparations which are intended to be con- 

15 verted shortly before use to liquid form preparations. Emulsions may be prepared in 
solutions in aqueous propylene glycol solutions or may contain emulsifying agents 
such as lecithin, sorbitan monooleate, or acacia. Aqueous solutions can be prepared 
by dissolving the active component in water and adding suitable colorants, flavours, 
stabilising and thickening agents. Aqueous suspensions can be prepared by dis- 

20 persing the finely divided active component in water with viscous material, such as 
natural or synthetic gums, resins, methylcellulose, sodium carboxymethylcellulose, 
and other well known suspending agents. Solid form preparations include solutions, 
suspensions, and emulsions, and may contain, in addition to the active component, 
colorants, flavours, stabilisers, buffers, artificial and natural sweeteners, dispersants, 

25 thickeners, solubilising agents, and the like. 

The compounds of the present invention may be formulated for parenteral admini- 
stration (e.g., by injection, for example bolus injection or continuous infusion) and 
may be presented in unit dose form in ampoules, pre-filled syringes, small volume 

30 infusion or in multi-dose containers with an added preservative. The compositions 
may take such forms as suspensions, solutions, or emulsions in oily or aqueous 
vehicles, for example solutions in aqueous polyethylene glycol. Examples of oily or 
nonaqueous carriers, diluents, solvents or vehicles include propylene -glycol, poly- 
ethylene glycol, vegetable oils (e.g., olive oil), and injectable organic esters (e.g., 

35 ethyl oleate), and may contain formulatory agents such as preserving, wetting, 
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emulsifying or suspending, stabilising and/or dispersing agents. Alternatively, the 
active ingredient may be in powder form, obtained by aseptic isolation of sterile solid 
or by lyophilisation from solution for constitution before use with a suitable vehicle, 
e.g., sterile,. pyrogen-free water. 

5 

Oils useful in parenteral formulations include petroleum, animal, vegetable, or syn- 
thetic oils. Specific examples of oils useful in such formulations include peanut, soy- 
bean, sesame, cottonseed, corn, olive, petrolatum, and mineral. Suitable fatty acids 
for use in parenteral formulations include oleic acid, stearic acid, and isostearic acid. 
10 Ethyl oleate and isopropyl myristate are examples of suitable fatty acid esters. 

Suitable soaps for use in parenteral formulations include fatty alkali metal, ammo- 
nium, and triethanolamine salts, and suitable detergents include (a) cationic deter- 
gents such as, for example, dimethyl dialkyl ammonium halides, and alkyl pyridinium 

15 halides; (b) anionic detergents such as, for example, alkyl, aryl, and olefin sul- 
fonates, alkyl, olefin, ether, and monoglyceride sulfates, and sulfosuccinates, (c) 
nonionic detergents such as, for example, fatty amine oxides, fatty acid alkano- 
lamides, and polyoxyethylenepolypropylene copolymers, (d) amphoteric detergents 
such as, for example, alkyl-.beta.-aminopropionates, and 2-alkyl-imidazoline quater- 

20 nary ammonium salts, and (e) mixtures thereof. 

The parenteral formulations typically will contain from about 0.5 to about 25% by 
weight of the active ingredient in solution. Preservatives and buffers may be used. In 
order to minimise or eliminate irritation at the site of injection, such compositions 

25 may contain one or more nonionic surfactants having a hydrophile-lipophile balance 
(HLB) of from about 12 to about 17. The quantity of surfactant in such formulations 
will typically range from about 5 to about 15% by weight. Suitable surfactants in- 
clude polyethylene sorbitan fatty acid esters, such as sorbitan monooleate and the 
high molecular weight adducts of ethylene oxide with a hydrophobic base, formed by 

30 the condensation of propylene oxide with propylene glycol. The parenteral formula- 
tions can be presented in unit-dose or multi-dose sealed containers, such as am- 
poules and vials, and can be stored in a freeze-dried (lyophilized) condition requiring 
only the addition of the sterile liquid excipient, for example, water, for injections, im- 
mediately prior to use. Extemporaneous injection solutions and suspensions can be 
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prepared from sterile powders, granules, and tablets of the kind previously de- 
scribed. 

The compounds of the invention can also be delivered topically. Regions for topical 
5 administration include the skin surface and also mucous membrane tissues of the 
vagina, rectum, nose, mouth, and throat. Compositions for topical administration via 
the skin and mucous membranes should not give rise to signs of irritation, such as 
swelling or redness. 

10 The topical composition may include a pharmaceutical^ acceptable carrier adapted 
for topical administration. Thus, the composition may take the form of a suspension, 
solution, ointment, lotion, sexual lubricant, cream, foam, aerosol, spray, suppository, 
implant, inhalant, tablet, capsule, dry powder, syrup, balm or lozenge, for example. 
Methods for preparing such compositions are well known in the pharmaceutical in- 

15 dustry. 

The compounds of the present invention may be formulated for topical administra- 
tion to the epidermis as ointments, creams or lotions, or as a transdermal patch. 
Creams, ointments or pastes according to the present invention are semi-solid for- 

20 mulations of the active ingredient for external application. They may be made by 
mixing the active ingredient in finely-divided or powdered form, alone or in solution 
or suspension in an aqueous or non-aqueous fluid, with the aid of suitable machin- 
ery, with a greasy or non-greasy base. The base may comprise hydrocarbons such 
as hard, soft or liquid paraffin, glycerol, beeswax, a metallic soap; a mucilage; an oil 

25 of natural origin such as almond, corn, arachis, castor or olive oil; wool fat or its de- 
rivatives or a fatty acid such as steric or oleic acid together with an alcohol such as 
propylene glycol or a macrogel. The formulation may incorporate any suitable sur- 
face active agent such as an anionic, cationic or non-ionic surfactant such as a sor- 
bitan ester or a polyoxyethylene derivative thereof. Suspending agents such as 

30 natural gums, cellulose derivatives or inorganic materials such as silicaceous silicas, 
and other ingredients such as lanolin, may also be included. 

Lotions according to the present invention include those suitable for application to 
the skin or eye. An eye lotion may comprise a sterile aqueous solution optionally 
35 containing a bactericide and may be prepared by methods similar to those for the 
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preparation of drops. Lotions or liniments for application to the skin may also include 
an agent to hasten drying and to cool the skin, such as. an alcohol or acetone, 
and/or a moisturiser such as glycerol or an oil such as castor oil or arachis oil. 

5 The pharmaceutical active compound described herein can be administered trans- 
dermal^. Transdermal administration typically involves the delivery of a pharmaceu- 
tical agent for percutaneous passage of the drug into the systemic circulation of the 
patient. The skin sites include anatomic regions for transdermal^ administering the 
drug and include the forearm, abdomen, chest, back, buttock, mastoidal area, and 
10 the like. 

Transdermal delivery is accomplished by exposing a source of the active compound 
to a patient's skin for an extended period of time. Transdermal patches have the 
added advantage of providing controlled delivery of a pharmaceutical agent- 

15 chemical modifier complex to the body. See Transdermal Drug Delivery: Develop- 
mental Issues and Research Initiatives, Hadgraft and Guy (eds.), Marcel Dekker, 
Inc., (1989); Controlled Drug Delivery: Fundamentals and Applications, Robinson 
and Lee (eds.), Marcel Dekker Inc., (1987); and Transdermal Delivery of Drugs, 
Vols. 1-3, Kydonieus and Berner (eds.), CRC Press, (1987). Such dosage forms can 

20 be made by dissolving, dispersing, or otherwise incorporating the pharmaceutical 
active compound in a proper medium, such as an elastomeric matrix material. Ab- 
sorption enhancers can also be used to increase the flux of the compound across 
the skin. The rate of such flux can be controlled by either providing a rate-controlling 
membrane or dispersing the compound in a polymer matrix or gel. 

25 

The compounds of the present invention may be formulated for administration as 
suppositories. A low melting wax, such as a mixture of fatty acid glycerides or cocoa 
butter is first melted and the active component is dispersed homogeneously, for ex- 
ample, by stirring. The molten homogeneous mixture is then poured into convenient 
30 sized molds, allowed to cool, and to solidify. 



35 



The active compound may be formulated into a suppository comprising, for exam- 
ple, about 0.5% to about 50% of a compound of the invention, disposed in a poly- 
ethylene glycol (PEG) carrier (e.g., PEG 1000 [96%] and PEG 4000 [4%]. 
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The compounds of the present invention may be formulated for vaginal administra- 
tion. Pessaries, tampons, creams, gels, pastes, foams or sprays containing in addi- 
tion to the active ingredient such carriers as are known in the art to be appropriate. 

5 When desired, formulations can be prepared with enteric coatings adapted for sus- 
tained or controlled release administration of the active ingredient. 

Pharmaceutical compositions usually comprise a carrier. Illustrative solid carrier 
include lactose, terra alba, sucrose, talc, gelatin, agar, pectin, acacia, magnesium 

10 stearate, stearic acid and the like. A solid carrier can include one or more sub- 
stances which may also act as flavoring agents, lubricants, solubiHzers, suspending 
agents, fillers, glidants, compression aids, binders or tablet-disintegrating agents; it 
can also be an encapsulating material. In powders, the carrier is a finely divided 
solid which is in admixture with the finely divided active ingredient. In tablets, the 

15 active ingredient is mixed with a carrier having the necessary compression proper- 
ties in suitable proportions, and compacted in the shape and size desired. The pow- 
ders and tablets preferably contain up to 99% of the active ingredient. Suitable solid 
carriers include, for example, calcium phosphate, magnesium stearate, talc, sugars, 
lactose, dextrin, starch, gelatin, cellulose, methyl cellulose, sodium carboxymethyl 

20 cellulose, polyvinylpyrrolidine, low melting waxes and ion exchange resins. 

Illustrative liquid carriers include syrup, peanut oil, olive oil, water, etc. Liquid carri- 
ers are used in preparing solutions, suspensions, emulsions, syrups, elixirs and 
pressurized compositions. The active ingredient can be dissolved or suspended in a 

25 pharmaceutical^ acceptable liquid carrier such as water, an organic solvent, a mix- 
ture of both or pharmaceutical^ acceptable oils or fats. The liquid carrier can contain 
other suitable pharmaceutical additives such as solubilisers, emulsifiers, buffers, 
preservatives, sweeteners, flavouring agents, suspending agents, thickening agents, 
colours, viscosity regulators, stabilisers or osmo-regulators. Suitable examples of 

30 liquid carriers for oral and parenteral administration include water (partially contain- 
ing additives as above, e.g. cellulose derivatives, preferably sodium carboxymethyl 
cellulose solution), alcohols (including monohydric alcohols and polyhydric alcohols, 
e.g. glycols) and their derivatives, and oils (e.g. fractionated coconut oil and arachis 
oil). For parenteral administration, the carrier can also be an oily ester such as ethyl 

35 oleate and isopropyl myristate. Sterile liquid carders are useful in sterile liquid form 
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compositions for parenteral administration. The liquid carrier for pressurised compo- 
sitions can be halogenated hydrocarbon or other pharmaceutical^ acceptable pro- 
pellant. Liquid pharmaceutical compositions which are sterile solutions or suspen- 
sions can be utilised by, for example, intramuscular, intraperitoneal or subcutaneous 
5 injection. Sterile solutions can also be administered intravenously. The compound 
can also be administered orally either in liquid or solid composition form. 

The carrier or excipient may include time delay material well known to the art, such 
as glyceryl monostearate or glyceryl distearate along or with a wax, ethylcellulose, 
10 hydroxypropylmethylcellulose, methylmethacrylate and the like. When formulated for 
oral administration, 0.01% Tween 80 in PHOSAL PG-50 (phospholipid concentrate 
with 1 ,2-propylene glycol, A. Nattermann & Cie. GmbH) has been recognised as 
providing an acceptable oral formulation for other compounds, and may be adapted 
to formulations for various compounds of this invention. 

15 

Combination therapies 

The targeting complex according to the present invention may be administrated i 
. combination with one or more second treatments, for example treatments which are 
20 currently used to treat cancer. 

For example such second treatments may be selected from the group consisting of 
surgical treatment, chemotherapy, radiation therapy, therapy with cytokines, 
Hormone therapy, gene therapy, immunotherapy and treatments using laser light. 

25 

Chemotherapy comprise administration of a chemotherapeutical agent, such as a 
cytostatica. Cytostatica according to the present invention may for example be se- 
lected from the group consisting of carboplatin, cisplatin, cyclophosphamide, iphos- 
phamide, hexamethylmelamine, doxorubicin, epirubicin, etopiside (VP-16), tenipo- 
30 side (VM-26), vincristine, vindecine, taxans, irinotecan, tyrosin kinase inhibitors, ni- 
mustine, lorriustine, BCNU, farnesyl transferase inhibitors, anti angiogenestic com- 
pounds, anti metastatic compounds, 5-fluoruracil ± leucovorin, topoisomerase in- 
hibitor I and II and Temozolamide. 
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In addition, chemotherapy may for example comprise administration of Anti- 
estrogen, Anti-progesteron, anti-androgen, LH-RH antagonists or aromatase inhibi- 
tors 

5 Examples 

The following are examples of embodiments of the invention and should not be 
regarded as limiting for the present invention. 

10 Example 1 

Culture of small cell lung cancer (SCLC) cell lines: 



The following small cell lung cancer cell lines were used for analysis 



Cell line 


Cell line established 


Growth: 
A= adherent 
S= suspension 


Growth medium 


CPH 54A . 


University of Copenhagen, Denmark 
(Engelholm eta!., 1986) 


A 


MEM (EAGLE) + 10% FCS 


CPH 54B 




MEM (EAGLE) + 10% FCS 


GLC2 


Groningen Lung Cancer Centre, The Netherlands 
(de Leij et al., 1986; Berendsen et al., 1988, 
Butte etal., 1993) 


MS) 


RPMI + 10% FCS 


GLC3 


S(A) 


RPMI + 10% FCS 


GLC14 


S 


RPMI + 10% FCS 


GLC16 


s 


RPMI + 10% FCS 


GLC19 


s 


RPMI + 10% FCS 


GLC 26 


s 


RPMI + 10% FCS 


GLC 28 


s 


RPMI + 10% FCS 


DMS 53 


Dartmouth Medical School, NH, USA 
(Pettengill et al., 1980) 


A 


Waymouth + 10% FCS 


DMS 79 


S 


RPMI + 10% FCS 


DMS 92 


MS) 


Waymouth + 10% FCS 


DMS 114 


A 


Waymouth + 10% FCS 


DMS 153 


A 


Waymouth + 10% FCS 


DMS 273 


A 


Waymouth + 10% FCS 


DMS 406 


A(S) 


Waymouth + 10% FCS 
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DMS 456 




MS) 


Waymouth + 10%FCS 


NCIH69 . 


National Cancer Institute, MD, USA 
(Carney etal., 1985) 


s 


RPMI + 10% FCS 


NCI N417 


s 


RPMI + 10% FCS 


MAR H24 


Philips University, Marburg, Germany 
(Bepleretal., 1987) 


s 


RPMI + 10% FCS 


MAR 86 Ml 


s 


RPMI + 10% FCS 



All cells were maintained at 37°C in 5% C0 2 in a humidified atmosphere in medium 
without antibiotics and passaged twice weekly. All media and serum were obtained 
from Life Technologies. 

5 

Xenografts 

0.5-1.2 x 10 7 cells were inoculated bilaterally, subcutaneously in the flanks of 12-13 
weeks old Balb/c nude mice. The mice were sacrificed and the xenografted tumors 

10 were harvested when one of the tumors had reached a maximal diameter of 1 cm. 
Necrotic tissue was removed. The cell line CPH 1 36A was only propagated in nude 
mice by inoculation of a 2 mm tumor block. Tumors for RNA isolation were either 
processed immediately or stored 24 hours in RNA later (Ambion) followed by stor- 
age at -70°C and processed as described below. Tumors used for lysates for West- 

15 ern blot analyses were processed immediately as described below. 

RNA from normal tissues 

Total RNA from normal, human tissues were obtained from either Clontech (fetal 
20 brain, brain, lung, kidney, heart, trachea, adrenal gland, prostate, salivary gland, 

thyroid) or from Ambion (lung, liver, brain, pancreas, spleen, small intestine, skeletal 
muscle, colon, stomach, testes). Only one sample was analysed in duplicate (lung 
RNA from Clontech and Ambion) and one in triplicate (brain RNA from 2 different 
batches from Clontech and one from Ambion). Fetal brain was included as a refer- 
25 ence for embryonal, neuroendocrine tissue. 

Isolation of RNA from cell lines. 



30 



Cells from semi-confluent cultures were harvested (by trypsinisation for adherent 
cells) and total RNA from approx. 10 7 cells was isolated using RNeasy Kit (Qiagen) 



i 



WO 03/000928 PCT/IB02/03534 

143 

according to manufacturers instructions. Xenografted tumors (fresh or after storage 
in RNA later) were homogenised in TRIzpl (Life Technologies) and RNA purified 
according to the manufacturers instruction. The TRIzol isolated RNA was further 
purified using RNAeasy kit (Qiagen). 
5 The concentration of the RNA was estimated by the absorption at 260nm (A 260 ). The 
integrity of the RNA was verified by measuring the ratio of A 26 o/28o to be 1 .9 or more 
and by estimating the ratio of 28S rRNA to 18S rRNA analysed by formaldehyde 
(denaturing) gel analysis to being approximately 2. 

10 Preparation of cDNA. 

10 ^g total RNA in 10 \i\ H 2 0 was hybridised to 100 pmol T7-(dT) 2 4 primer 
(HPLC purified 5'- GGCCAGTGAATTGTAATACGACTCACTATAGGGAGG- 
CGG(T) 24 (SEQ ID NO 401) obtained from GENSET) after denaturation at 70°C for 

15 10 min. The following reactions were performed using reagents from Gibco BRL, 
Life Technologies. First strand synthesis was performed using 400 U Superscript 
RnaseH' Reverse Transcriptase kit in a 20 \i\ reaction in first strand buffer (50 mM 
Tris-HCI (pH 8.3), 75 mM KCI, 3 mM MgCI 2 ) 10 mM DTT, 0.5 mM dNTPs (each) at 
42°C for 1 hour. The second strand synthesis was performed in a 150 ^l reaction in 

20 second strand buffer (20 mM Tris-CI (pH 6.9), 5 mM MgCI 2 , 100 mM KCI, 0.15 mM 
P-NAD\ 10 mM(NH 4 ) 2 S04 containing 0.26 mM dNTPs, 0.07 E. coli DNA ligase, 
0.27 U/jxl E. coli DNA polymerase, 0.013 \J/\i\ E. coli Rnase H by incubation for 2 
hours at 16°C. DNA ends were filled out by addition of 0.07 U/^l T4 DNA polymer- 
ase and incubation for 5 min at 16°C. The reactions were terminated by addition of 

25 EDTA to 33 \iM final concentration. The cDNA was purified by extraction with 1 vol- 
ume phenol:chloroform:isoamylalchohol (25:24:1) saturated with 10 mM Tris-HCI 
(pH 8.0), 1 mM EDTA followed by precipitation in 2.5 M NH 4 Ac in 63% ethanol with 
addition of 2 \i\ Pellet Paint (Novagen) for visualization of pellet. After 2 consecutive 
rinsing of the pellet with 80% ethanol, the pellet was air dried and dissolved in 12 ^il 

30 water. An aliquot was analysed by agarose gel electrophoresis to ensure the length 
of the cDNA to be in the range of 0.1 -> 10 kb. 



Preparation of biotin labelled cRNA (IVT-cRNA) 
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In vitro transcription generating biotin labelled cRNA (complementary RNA) with T7 
RNA polymerase using biotin labelled ribonucleotides was performed with the BioAr- 
ray™, High Yield™ RNA transcript labelling kit from Enzo Diagnostics, NY, USA) 
using 6 ^l cDNA (estimated to contain approx. 1jig cDNA) in a 40 \i\ reaction ac- 
5 cording to manufacturers specifications. The biotin labelled cRNA was purified using 
RNeasy spin columns kit (Qiagen) according to manufacturers specified method for 
RNA cleanup. An aliquot of the IVT-cRNA was analysed by denaturing agarose gel 
electrophoresis to ensure full length transcripts (.1->10 kb). The concentration of the 
cRNA was estimated by the absorption at 260nm and corrected for contribution of 
10 total RNA initially used for the cDNA reaction. The yield varied from 25-100 \ig per 
reaction. 

Fragmentation of IVT-cRNA 

15 22 jig IVT-cRNA was fragmented by incubation in 0.04 M Tris-Acetate (pH 8.1), 0.03 
M MgAc, 0.1 M KAc in a 20 pi reaction for 35 min at 94°C. An aliquot of the frag : 
mented IVT-cRNA was analysed by agarose gel electrophoresis to ensure frag- 
mentation to the size of 30-200 bases. 

20 Hybridisation to Affvmetrix GeneChip™ and analysis of data (CHIPs analysis) 

A hybridisation mixture containing 20 ng of fragmented IVT-cRNA in a volume of 
400 |jU containing 0.1 M MES, 0.75, [Na*] , 0.1 mg/ml herring sperm DNA, 0.1 mg/ml 
acetylated BSA, 0.05 nM biotinylated control oligo B2 (5'- GTCGTCAA- 

25 GATGCTACCGTTCAGGA (SEQ ID NO 402)) and control biotin labelled IVT-cRNA 
for spiking prepared from the plasmids pglks-bioB (150 pM), pglks-bioC (500 pM), 
pglks-bioD (2.5 nM) and pglks-cre (10 nM) (American Tissue Culture Collection). 
The control oligo and control cRNAs were obtained from Affymetrix. 100 \i\ was hy- 
bridised to an Affymetrix test2 CHIP followed by staining with a streptavidin- 

30 phycoerythrin conjugate and labelling with biotinylated anti-streptavidin goat anti- 
body followed by a final staining with streptavidin-phycoerythrin conjugate (accord- 
ing to the manufacturers protocol Mini-euk1) or 300 )iil was hybridised to an Af- 
fymetrix U95A GeneChip and stained according to the manufacturers protocol Eu- 
kGE-WS2 in an Affymetrix Fluidics station and scanned at 560 hm in a confocal la- 

35 ser scanner (Hewlett Packard GeneArray Scanner G2500A). The digitalized image 
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data was first processed using Affymetrix Microarray Suite™ version 4.0 for evalua- 
tion of the quality of the RNA and hybridisation and Affymetrix Data Mining Tool 
(version 2.0) for selection of candidate genes. The data was re-analysed using Af- 
fymetrix Microarray Suite™ version 5 (see results) for selection of surface molecules, 
5 Data was only used from analyses where: the control oligos BioB, BioC, BioD and 
Cre were all detected as present; the scaled noise (Q) was below 10; the ratio of 
detection of the mRNA levels of the 5' ends relative to the 3' end of glyceraldehyde- 
3-phosphate dehydrogenase (GAPDH) and beta-actin were below 2 ; at least 40% 
of all probe sets were identified as present. For comparison between samples, the 
1 0 global intensity was set at 1 00. 

RT-PCR 

Semi-quantitative RT-PCR was performed on selected genes for validation of the 
15 Chips analysis. cDNA prepared as described above was used for the RT-PCR but 
as an independent preparation than used for Chips analysis. The PCR reaction was 
performed using cDNA from 350 ng total RNA in a 25 jil reaction with 200 nM prim- 
ers (DNA Technology A/S), 1.5 mM MgCI 2 , 0.2 mM each dNTPs, 0.1U/*il Platinum 
Taq Polymerase (Life Technologies) in the buffer provided with the enzyme with 
20 0.008% cresol red and 12% sucrose as loading buffer. 

All reactions were run 94°C, 2 min, 1 cycle; 94°C, 30 sec, annealing temperature as 
indicated for each primer set, 30 sec, 72°C, 30 sec for 25 cycles and a final exten- 
sion step of 72°C 10 min. Using only 25 cycles makes the reaction semi- 
quantitative. 

25 

The primer sets used were: 

Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) 
30 51 2 bp PCR product spanning GenBank Acc. no. NM J)02046 bp. 608-1 119, 
Sense: 5'-TCCATGCCATCACTGCCACCCA (SEQ ID NO 403) 
Antisense: 5-TCTTGTGCTCTTGCTGGGGCTG (SEQ ID NO 404) Annealing temp. 
56°C 

One RT-PCR reaction has been performed. 



WO 03/000928 



PCT/D802/03534 



146 

Pro 30 (KIAA0042) 

432 bp PCR product spanning GenBank Acc. no. D26361 bp. 5181-5612 , 
Sense: 5'- GTTTTGAATCTGAAGAAAGCCC (SEQ ID NO 405) 
Antisense: 5'-TCAAACTCCTGACCTTGTGATCT (SEQ ID NO 406) Annealing 
5 temp. 49°C. 

2 independent RT-PCR reactions have been performed. 

Pro 41 (MAD2) 

525 bp PCR product spanning GenBank Acc. no. AJ00Q186 bp. 643-1 167 , 
10 Sense: 5'- GTAAATAGCATGGTGGCCTACA (SEQ ID NO 407) . 

Antisense: 5'-GGTCCAAAGGAGCTATACAGCA (SEQ ID NO 408) Annealing temp. 
45°C. 

2 independent RT-PCR reactions were performed. 

15 

Pro 221 (insulinoma-associated antigen, lA-1) 

532 bp PCR product spanning GenBank Acc. no. M931 19 bp. 1 549-2080 , 
Sense: 5'- GTGTTCCCCTGCAAGTACTGCCC (SEQ ID NO 409) 
Antisense: 5'-CAGAGATTGGTAGGCGAGGCGA (SEQ ID NO 41 0) Annealing 
20 temp. 52°C 

2 independent RT-PCR reactions were performed. 

Pro 210 (lamin B1) 

25 439 bp PCR product spanning GenBank Acc. no. L37747 bp. 424-862 , 
Sense: 5'-ACTGTGTACTGTTCGGAAGGG (SEQ ID NO 41 1) 
Antisense: 5'-TAGAGAAACCCTTCCCTCCC (SEQ ID NO 412) Annealing temp. 
46°C. 

Only one RT-PCR reaction has been performed. RT-PCR was not performed on 
30 testis 

• Pro 7l(p16INK4/MTS 1, CDKN2A) 
437 bp PCR product spanning GenBank Acc. no. U26727 bp.176-612 . 
35 Sense: 5'- TGAGGAGCCAGCGTCTAGGG (SEQ ID NO 41 3) 
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Antisense: 5'-GTGGCCCTGTAGGACCTTCG (SEQ ID NO 414) Annealing temp. 
57°C 

Only one RT-PCR reaction has been performed. RT-PCR was not performed on 
testis 

5 

DR6 (TNFRS12, tumor necrosis factor receptor superfamiliy member 21) 
559 bp PCR product spanning GenBank Acc. no. AF068868 bp. 1081-1639 , 
Sense: 5 - GTGCTTGTGGTGATTGTGGTGTG (SEQ ID NO 415) 
Antisense: 5'-TGTTCTTGTCCTGTGGGGAAGG (SEQ ID NO 416) Annealing temp. 
10 56°C. 

2 independent RT-PCR reactions were performed. 

NCAM1 (neural cell adhesion molecule) 

456 bp PCR product spanning GenBank Acc. no. HSU63041 bp 2045-2500, 
1 5 Sense: 5'- TATGAGGTCTACGTGGTGGC (SEQ ID NO 41 7) 

Antisense: 5'- CTCCTGGCACTCTGGCTTTG (SEQ ID NO 418) Annealing temp. 53 
°C. 

Only one RT-PCR reaction has been performed. RT-PCR was not performed on 
testis 

20 

NPTXR (Neuronal pentraxin receptor) 

482 bp PCR product spanning GenBank Acc. no. HS327J16 bp.4601 2-46493 , 
Sense: 5'- CACACGCACACATGTTGCAGC (SEQ ID NO 41 9) 
Antisense: 5 - GCTCTGAGAGGCCAAAGCC (SEQ ID NO 420) Annealing temp. 
25 55°C. 

Only one RT-PCR reaction has been performed. RT-PCR was not performed on 
testis , 

GLUR2 (ionotropic glutamate receptor 2; GRIA2) 
30 522 bp PCR product spanning GenBank Acc. no. L20814 bp. 2449-2970 , 
Sense: 5*- AGGAACCCCAGTAAATCTTGCAG (SEQ ID NO 421) 
Antisense: 5 - TCAGTCACACTGACATTCATTCCC (SEQ ID NO 422) Annealing 
temp. 51 °C 

Only one RT-PCR reaction has been performed. RT-PCR was not performed on 
35 testis 
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ITGA V (integrin alpha V subunit) 

533 bp PCR product spanning GenBank Acc. no. M14648 bp. 3867-4399 , 
Sense: 5'- AATTTTAGGTCAAATCCTTCAAGCCAAC (SEQ ID NO 423) 
5 Antisense: 5-TGACAGCCGAGACTGATTTTACACATTA (SEQ ID NO 424) An- 
nealing temp. 50°C. 

Only one RT-PCR reaction has been performed. RT-PCR was not performed on 
testis 

1 0 LRP8 (apolipoprotein E receptor 2) 

459 bp PCR product spanning GenBank Acc. no. HSZ75190 bp. 2016-2474 , 
Sense: 5'- GCTCCATATAGGGAGAACTGCTCAG (SEQ ID NO 425) 
Antisense: 5-CCCCAGCAACCAAACATCTTCT (SEQ ID NO 426) Annealing temp. 
50°C. 

15 Only one RT-PCR reaction has been performed. RT-PCR was not performed on 
testis. 

Western blotting 

20 Protein samples 

Whole cell lysates were extracted from cell lines and xenografted tumors for valida- 
tion of protein expression of selected genes. The lysates were prepared from semi 
confluent cultures of cell lines by scraping with a rubber policeman (for adherent 
cells) and washing in ice-cold 20 mM Tris-CI pH 7.5. The cell pellet was lysed in ice 

25 cold 20 mM Tris-CI pH 7.5, 2% Triton X-1 00 containing Protease Inhibitor Cocktail 
set II and III (Calbiochem) diluted 1:100. After vortexing the lysates were cleared by 
centrifugation at 15.000 x g for 5 min, 4°C. Lysates from xenografted tumors was 
prepared immediately after harvesting of the tumors and a lysate from an adult rat 
brain was processed in a similar manner The tumors were weighed and homoge- 

30 nised using a Heindolph DIAX 900 homogenised in 5 volumes (w/w) ice cold 20 mM 
Tris-CI pH 7.5, 2% Triton X-100 containing Protease Inhibitor Cocktail set II and III 
(Calbiochem) diluted 1:100. cleared by centrifugation at 1 5.000 x g for 5 min, 4°C. 
Protein concentration of the lysates was determined using the BCA Protein Assay 
(Pierce) as recommended by the manufacturer. 
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Commercial cell lysates of Jurkat (Santa Cruz) and A431 (Neomarkers) were used 
as positive controls in some western blots. 

SDS-PAGE and blotting. 1 
5 5-15 fig lysate was loaded per lane in LDS sample buffer with reducing agent (Nu- 
PAGE) and separated on 3-8% Tris Acetate SDS gels, run for 150 V 1 hr in Tris- 
Acetat SDS running buffer (NuPAGE) and transferred to PVDF LC 2002 (Novex) 
membrane in Transfer Buffer (NuPAGE). Protein size marker was ProSieve colour 
protein marker. For probing with anti-NCAM1 antibodies, the lysates were pre- 
10 treated for 5 min at 37°C with 40 ng/[il recombinant EndoN-HIS (gift from E. Bock) 
to remove polysialylation. 

The membranes were blocked in washing buffer (10 mM Tris-CI pH 7.5, 100 mM 
NaCI, 0.1 % Tween 20) containing 5% low fat milk for 60 min at room temperature 
(for antibodies against Integrin ocE (CD103), For ITGAE a Tris-CI buffer pH 10.2 was 
15 used for all incubation and washing procedures. The blots were incubated with pri- 
mary antibodies and secondary antibodies in blocking buffer as described below and 
bound antibodies visualised by ECL (Amersham) or alkaline phosphatase using 
NBT/BCIP tablets (Roche) as recommended by the manufacturers. 

20 NCAM1 (neural cell adhesion molecule) 

Primary antibody: Mouse monoclonal anti-NCAM1 clone 123C3 (Santa Cruz) diluted 
1 : 100 Incubation 16 hours at 4°C. Secondary antibody: Alkaline phosphatase 
conjugated rabbit anti-mouse Ig (DAKO) diluted 1: 500. Incubation 1 hour at room 
temperature. Development by alkaline phosphatase. 

25 

GluR2 (ionotropic glutamate receptor 2) 

Primary antibody: Mouse monoclonal anti GluR2 and 4 (clone3A1 1) (Pharmingen) 
diluted 1:500 Incubation 16 hours at4°C. Secondary antibody: Alkaline phospha- 
tase rabbit anti mouse Ig (DAKO) diluted 1: 500 Incubation 1 hour at room tem- 
30 perature. Development by alkaline phosphatase. 

GRM8 (GluR8 (metabotropic glutamate receptor 8)) 

Primary antibody: Rabbit polyclonal anti-mGluR8 (Upstate Biotechnology, TriChem) 
diluted 1: 500. Incubation 16 hours at4°C. Secondary antibody: Horseradish peroxi- 
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dase swine anti rabbit Ig (DAKO) diluted 1 : 1000 Incubation 1 hour at room tem- 
perature. Development by ECL. 

NPTXR (neuronal pentraxin receptor) 
5 Primary antibody: Goat polyclonal anti NPTXR (C-17)(Santa Cruz) diluted 1 : 500 

Incubation 16 hours at 4°C. Secondary antibody: Horseradish peroxidase rabbit anti 
goat Ig (DAKO) diluted 1: 1000 Incubation 1 hour at room temperature. Develop- 
ment by ECL 

10 ITGAE (integhn alpha E subunit) 

Primary antibody: Goat polyclonal anti Integrin aE (N-19) (Santa Cruz) diluted 
1:1000. Incubation 16 hours at 4°C. 

Secondary antibody: Alkaline phosphatase rabbit anti goat Ig (sc-2771) (Santa Cruz) 
diluted 1 : 500. Incubation 1 hour at room temperature 
1 5 Development by alkaline phosphatase. 

Cluster analysis of obtained Chips analysis data. 

To interpret the variation in expression patterns seen between the normal tissues 
20 analysed and the small cell ling cancer cell lines we took advantage of the proper- 
ties of both SOMs (self-organising maps) and hierarchical clustering. These were 
used consecutively to group genes on the basis of similarity in the pattern of expres- 
sion. Genes used for the analysis were those that had an average difference of 
more than 50 and were scored present in any one of the samples. 

25 

SOMs 

Self-Organising maps (SOMs) is a method of cluster analysis that is somewhat re- 
lated to k-means clustering. The basic principle behind the SOM algorithm is that the 
30 weight vectors of neurons, which are first initialised randomly, come to represent a 
number of original measurement vectors during an iterative data input (Toronen et 
al, 1999). The following parameters were used in the calculations: Genes: Xdim: 1, 
Ydim: 10, Iterations: 100000, Samples. Xdim: 1, Ydim: 10, Iterations: 20000. 

35 Hierarchical Clustering 
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The basic idea behind hierarchical clustering is to assemble a set of items (genes or 
arrays) into a tree, where items are joined by very short branches if they are very 
similar to each other, and by increasingly longer branches as their similarity de- 
creases. The output file from the SOM clustering is used for the hierarchical clus- 
5 tering, meaning that the ordering by the SOM clustering is used to guide the flipping 
of nodes in the hierarchical tree (Eisen et al., 1998). The following parameters were 
used in the calculations: Genes: Cluster: Yes, Calculate weights: Yes, Similarity 
matrix: correlation uncentered, Samples: Cluster: Yes, Calculate weights: Yes, 
Similarity matrix: correlation uncentered. Subsequently an Average Linkage cluster 
10 analysis was performed. 

Results of clustering analysis 

Clustering of the X-axis (samples) (Fig 2) showed as expected that the SCLC cell 
15 lines clustered together, with Mar86MI and CPH54A being furthest apart. CPH54A 
and B clustered very close (B is a clonal variant of A), as did GLC14, GLC16 and 
GLC19 (derived from the same patient). Of the normal tissues expression from lung 
RNA obtained from Ambion and lung RNA obtained from CLONTECH clustered very 
close, as did 2 different batches of brain RNA obtained from CLONTECH with brain 
20 RNA obtained from Ambion. Expression from RNA obtained from fetal brain likewise 
clustered close to the mature brain and was the closest of all normal tissues to the 
SCLC cell lines. This confirms that SCLC lines are of neuroendocrinal origin. 
Clustering of the Y-axis (genes) clearly found 4 very distinct clusters of genes with 
higher expression in the SCLC cell lines. The smallest contained 19 genes, the sec- 
25 ond and third 65 each, and the fourth and largest gene cluster 268. 

Selection criterias for candidate promoters (first nucleic acid sequences). 

The candidate promoters were chosen based on expression level of the gene, which 
30 the promoter controls. The selection was performed on all 21 SCLC cell lines, but 
not xenografts and on 7 normal tissues (brain, adrenal gland, lung, kidney, heart, 
prostate, pancreas). 

Selection was based on several criteria. Only genes, which scored present (P) in the 
absolute call and with an Average difference >50 (level of expression) were in- 
35 eluded. These output data were further processed in Microsoft Excel 2000. Genes 
were selected which were scored present in at least 1 1 of the 21 SCLC lines and if 
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the gene was scored present in one or more normal tissues, the median Average 
difference value of the SCLC cell lines must be 4 times or more above the median 
Average difference value of ( the normal tissue. After a second screening using RNA 
from more normal tissues, the selected candidates are submitted to the same crite- 
5 ria as above and discarded if they do not fulfil the above requirements. 

Validation of Chips analysis by RTtPCR 

Selected genes were analysed by semi-quantitative RT-PCR for verification of ex- 
10 pression identified by Chips analysis. The quality of the cDNA was tested using 
primers for GADPH (Glyceraldehyde-3-phosphate dehydrogenase) (Fig. 2). All 
cDNA samples were very positive showing that the quality of the cDNA for further 
analysis was good. 

RT-PCR with primers for Pro 221 (IA-1, insulinoma associated antigen 1) (Fig. 3) 
15 showed that in normal tissue adrenal gland and brain and fetal brain are weakly 

positive in both Chips and RT-PCR analysis. 4 SCLC lines or xenografts are nega- 
tive in both analyses. All others are weak to very strongly positive. The RT-PCR and 
Chips analysis correlate extremely well. 

RT-PCR with primers for Pro 30 (KIA0042) (Fig. 4) showed that in normal tissue 
20 testes is positive for both Chips and RT-PCR. Other normal tissues are low or nega- 
tive by both analysis methods. All SCLC cells and xenografts are positive in both 
Chips and RT-PCR analysis. There are a few samples where the relative amounts in 
Chips and RT-PCR do not correlate (e.g. high in one and low in the other analysis). 

25 RT-PCR with primers for Pro 41 (MAD2) (Fig. 5) shows low expression in most nor- 
mal tissues and high expression in testes measured both by Chips analysis and RT- 
PCR. All SCLC cell lines and xenografts show very high expression by both Chips 
and RT-PCR analysis. 

RT-PCR with primers for Pro 210 (lamin B1) (Fig. 6) showed very low or no expres- 
30 sion in normal tissues (colon positive for both assays). All SCLC and xenografts 

have high expression by Chips analysis - all except 2 are very positive by RT-PCR. 
RT-PCR values are arbitrarily chosen to match Chips signal. 
RT-PCR with primers for Pro 71 (CDKN2A) (Fig. 7) showed very low or no expres- 
sion in normal tissues and high expression in all but 4 SCLC. Excepto for one sam- 



WO 03/000928 



PCT/IB02/03534 



153 

pie negative in RT-PCR and positive in Chips analysis, the RT-PCR and Chips data 
correlate very well. 

Conclusion on validation of Chips analysis by RT-PCR. 

5 

The Chips data and RT-PCR data correlate extremely well. The low to none expres- 
sion in most normal tissues observed by Chips analysis is confirmed by the semi- 
quantitative RT-PCR reaction. The expression of the selected genes in SCLC cell 
lines and xenografts are very high and in all or most cell lines. Therefore using 
10 Chips analysis for identification of promoters with high and specific expression is an 
applicable method. 

Selection criteria's for candidate cell surface molecules identified by Chips analysis. 

15 The first generation of candidate cell surface molecules were selected on basis of 
several criteria. The selection was performed on all 21 SCLC cell lines, but not 
xenografts and on 7 normal tissues (brain, adrenal gland, lung, kidney, heart, pros- 
tate, pancreas). Only genes, which scored present (P) in the absolute call and with 
an Average difference >50 were included. These output data were further processed 

20 in Microsoft Excel 2000. A gene was set to score one point for each cell line or tis- 
sue. The total scores for each gene were summarised for normal tissue and the 
SCLC cell lines, respectively. Genes were selected which were scored present in at 
least 5 of the 21 SCLC lines. A search was performed among these and candidate 
genes selected if one of the following words is included in the gene name: "receptor, 

25 membrane,, adhesion, integrin, surface, antigen, syndecan, transport, channel, hor- 
mone, binding, glycoprotein, matrix, CAM, desmosome, gap junction, delta, immu- 
noglobulin, MHC, CD, (HSPG, CSPG, integral, notch)". The functions and cellular 
localisations of the proteins were unravelled based on database searches (NCBI: 
Nucleotide, Protein, Nucleotide, OMIM, PubMed, LocusLink). The best candidate 

30 genes were then selected based on these informations with emphasis on the func- 
tion, cellular localisation and scores on expression (i.e. higher "expression score" for 
SCLC than for normal tissue). Furthermore, the expression in the different normal 
tissue is evaluated according to the specific tissues, in order to estimate the theo- 
retical side effects. A second selection was performed on RNA from 21 SCLC cell 

35 lines, 8 of the cell lines grown as xenografts and 17 normal tissues (brain, lung, kid- 
ney, heart, trachea, adrenal gland, prostate, salivary gland, thyroid, liver, pancreas, 
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spleen, small intestine, skeletal muscle, colon, stomach, testes) using the Affymetrix 
Microarray Suite™ version 5. Only expressed genes, which scored present (P) in the 
absolute call and with a signal >20 in at least 6 SCLC cell lines or xenografts were 
included. Further selection was performed as described above. 

5 

Validation of Chips analysis by RT-PCR 

Selected genes were analysed by semi-quantitative RT-PCR for verification of ex- 
pression identified by Chips analysis. RT-PCR with primers for DR6 (TNFR related 

10 death receptor 6) (Fig. 8) shows medium expression in most normal tissues and 
medium to high in all except one SCLC line or xenograft. Chips analysis shows 
high expression in 2 normal tissues and high expression in 8 SCLC lines or xeno- 
grafts. All positive by Chips analysis are also positive in RT-PCR. 
RT-PCR with primers for LRP8 (Apolipoprotein E receptor 2) (Fig. 9) shows low ex- 

15 pression in 6 normal tissues and high expression in all SCLC lines and xenografts 
are positive by RT-PCR. All positives in Chips analysis are also positive in RT-PCR. 
RT-PCR values are arbitrarily chosen to match Chips signal. 
RT-PCR with primers for NTPXR (neuronal pentraxin receptor) (Fig. 10) showed 
that all positive by Chips analysis are also positive by RT-PCR. There are more tis- 

20 sues and SCLC with positive expression as measured by RT-PCR, but expression is 
averagely higher in SCLC! Two SCLC samples are negative. RT-PCR values are 
arbitrarily chosen to match Chips signal. 

RT-PCR with primers for NCAM1 (neural cell adhesion molecule) (Fig. 1 1) showed 
all samples positive by Chips analysis were also positive by RT-PCR. Several tis- 
25 sues and all except one SCLC are positive by RT-PCR only. One SCLC cell line is 
negative in both RT-PCR and Chips analysis. RT-PCR values are arbitrarily chosen 
to match Chips signal. 

RT-PCR with primers for GluR2 (ionotropic glutamate receptor 2)(Fig. 12A) showed 
all samples positive by Chips analysis were also positive by RT-PCR. Both analysis 
30 showed very high expression in brain and RT-PCR low expression in adrenal gland. 
4 SCLC cell lines are negative in both RT-PCR and Chips analysis. RT-PCR values 
are arbitrarily chosen to match Chips signal. 

RT-PCR with primers for ITGAV (integrin alpha v subunit) (Fig. 12B). 5 samples are 
positive in Chips analysis, but negative by RT-PCR. Otherwise good correlation 
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between Chips analysis and RT-PCR analysis. High expression in SCLC, but also in 
many tissues. 

RT-PCR values are arbitrarily chosen to match Chips signal. 

5 Conclusion on validation of Chips analysis of expression of surface molecules by 
RT-PCR. 

Except for ITGAV all genes identified as expressed by Chips analysis were also 
found expressed when analysed by RT-PCR. More samples were positive when 
10 measured by RT-PCR. The expression of the selected genes in SCLC cell lines and 
xenografts are high and in many cell lines. Therefore using Chips analysis for identi- 
fication of mRNA for surface molecules expressed by SCLC is an applicable 
method. 

15 Validation of Chips analysis bv western blotting 

Expression of selected gene products was analysed by western blotting using spe- 
cific antibodies for comparison to of expression of mRNA identified by Chips analy- 
sis. Western blot analysis was only performed on SCLC cell lines and xenografts. 

20 Western blot analysis using antibodies to mGluR8 (metabotropic glutamate receptor 
8) (Fig. 13) showed expression of mGluR8 protein in all SCLC cell lines and xeno- 
grafts, whereas Chips analysis only detected expression in 8 samples. The intensi- 
ties of the western blot do not correlate to the Chips values, but clearly show ex- 
pression of mGluR8. Rat brain homogenate was used as positive control. 

25 Western blot analysis using antibodies to NPTXR (neuronal pentraxin receptor) (Fig. 
14) showed protein expression in all SCLC samples identified as having expression 
by Chips analysis. All samples are weak to strongly positive except GLC 28. DMS 
153 has a prominent high molecular weight band also present in rat brain, which 
may be unprocessed or dimerised receptor. The protein amounts do not directly 

30 correlate with the Chips data, but clearly show expression in most SCLC. Rat brain 
homogenate was used as positive control. 

Western blot analysis using antibodies to NCAM1 (neural cell adhesion molecule) 
(Fig. 15) showed expression of two isoforms of NCAM1 by all SCLC cell lines and 
xenografts except one, whereas Chips analysis identified expression in 14 samples. 
35 All samples positive by Chips analysis are positive by western blotting. 
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There is no obvious correlation between relative amounts in Chips analysis and 
western blotting. 1 

Western blot analysis using antibodies to GluR2 (ionotropic glutamate receptor 2) 
(Fig. 16) showed expression in 9 samples. 6 samples were positive by Chips analy- 
5 sis, but negative by western blotting. However, the sensitivity of the antibody was 
not high.The other positive samples correlate well with the Chips analysis. 
Western blot analysis using antibodies to ITGAE (integrin alpha E subunit) (Fig. 17) 
showed expression in most SCLC samples. One sample was positive in Chips 
analysis and negative by Westren blotting. The relative intensities of expression 
1 0 between Chips analysis and western blotting do not correlate for many samples. 
A431 cell lysate was used as positive control. 

Conclusion of western blot validation of Chips analysis on surface molecules 
1 5 For the selected surface molecules all genes identified as expressed by Chips 

analysis are also identified as expressed by western blotting showing that gene ex- 
pression measured by Chips analysis is reflected in protein synthesis. For several 
genes western blotting identified expression in more samples than Chips analysis. 
Therefore the Chips analysis is an applicable method to identify surface molecules 
20 expressed by SCLC. 

Example 2 

Surface molecules expressed by SCLC cell lines identified by RT-PCR 

25 

Other expressed cell surface molecules were identified by the method of RT-PCR. 
mRNA was prepared from all 21 of the above listed cell lines using Quick- 
Prep®mRNA Purification Kit (Pharmacia) according to manufacturers specifications, 
mRNA or total RNA from 29 different tissues was obtained from CLONTECH. The 
30 RNA was obtained from the following tissues: whole brain, spinal cord, small intes- 
tine, kidney, heart, lung, testis, retina, bladder, stomach, uterus, liver, spleen, leuko- 
cyte, adipocyte, pituitary gland, ovary, mammary gland, prostate, trachea, thymus, 
adrenal gland, colon, pancreas, salivary gland, bone marrow, thyroid, lymph node 
and skeletal muscle. 
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Single-stranded cDNA synthesis was performed using the 1 st strand cDNA synthesis 
Kit for RT-PCR (Boehringer Mannheim) according to manufacturers instructions 
using an oligo-(dT) 15 primer. 

Subsequent PCR with the cDNAs as template was performed in 10 mM Tris-CI (pH 
5 8.3), 50 mM KCI, 1 mM MgCI 2 , 0.8 mM dNTPs, 0.4 \iM primers and 0.12 U/^l Ther- 
moprime plus DNA polymerase (Advanced Biotechnologies) with amplifications of 
35 or 40 cycles of 95°C for 30 sec, 62°C for 30 sec and 72°C for 1 min. A control 
reaction using GADPH primers was performed on all cDNAs. The PCR products 
were analysed by agarose gel electrophoresis. Listed below are the gene products 
10 analysed, the sequence of primers with their position in the nucleotide sequence in 
the GenBank database and the percent of cell lines or tissues, which were positive 
for mRNA from the corresponding gene. 





Acc. No. 


I III! 9Cl|Ut7lll#t; 

i 


"Uoiiiun 


% RT-PCR positive 


Normal 


SLCL 
lines 


Atrial na- 
triuretic 
peptide 
clearance 
receptor 


AF025998 


5'- 

AGCGGAACTGCTACTTC 
ACC (SEQ ID 427) 


629-648 


95.5 % 


95.5 % 






5"- 

TAGTCTCCACTGGTCAT 
GCC (SEQ ID 428) 


851-832 






Gas- 

trin/CCK-B 
receptor 


XM_006034 


5-GTGCGA- 
ATGTTGCTGGTGATCG 
(SEQ ID 429) 


994-1015 


100%* 


100 % 






5-ACGGTGCATGAAGCAG 
TAGACC (SEQ ID 430) 


1185-1164 


Neu- 
romedin B 
receptor 


M73482 


5AGATGGAAACACGGAA 
ACGCCTGG (SEQ ID 431) 


909-932 


96.5 % 


95.2 % 






5'-GGCTGTTGAA- 


1151-1128 
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GGCTGTTGAA- 
ATGCCTCCTGAAGC 
(SEQ ID 432) 








Glial cell 
line derived 
neuro- 
trophic 
factor a 
receptor 


NM_001496 


5'- 

TCTGCTTCTCCGACCCG 
CTT (SEQ ID 433) 


761-780 


96.5 % 


95.2 % 






5-TAGCTGCA- 
ATGGCCTCCGTG (SEQ 
ID434) 


1042-1023 


Bombesin 

receptor 

(GRPR) 


XM.010317 


5'- 

CATGCTCCACTTTGTCA 
CCAGC (SEQ ID 435) 


1289-1310 


100%* 


100% 






5'- 

GAGGTCATGCAGGTTGT 
ACTCC (SEQ ID 436) 


1477-1456 


Metabo- 
tropic glu- 
tamate 
receptor 8 




5'CCAGAGCTAAGTGATA 
ACACCAGG (SEQ ID 437) 




O-i A 0/ 
CA . 1 70 


AC O 0/ 






5'- 

TTCACGTGGGAI 1 1 ICT 
GTGACTG (SEQ ID 438) 


.801-825 



analysed on RNA from 7 normal tissues 



The data from the RT-PCR experiments clearly suggest, that the metabotropic glu- 
tamate receptor 8 is a candidate receptor, as it is expressed in 95.2% of the SCLC 
5 cell lines, but only in 21 .1 % of normal tissues. Other receptors are also candidates, 
as they are expressed in more than 95% of the cell lines. A quantification of the 
relative levels of the RNA expression by real-time RT-PCR or northern blotting will 
further identify the suitable receptors. 
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Example 3 

Surface molecules expressed by SCLC cell lines identified by Western blotting 

5 

18 of 19 tested SCLC lines from the same panel as above were found to express 
the surface molecules: the neural cell adhesion molecule (NCAM1) and cadherin 
(Rygaard et al., 1992). The expression in the SCLC ceil lines was determined by 
western blotting utilizing polyclonal antibodies on protein extracts from the cell lines 

10 propagated both in vitro and as xenografts in nude mice. NCAM1 was detected by 
immunohistochemical methods in 20 of 20 surgically ressected SLCL tumours dem- 
onstrating that SCLC cells express NCAM1 in vivo (Kibbelaar, et al., 1991). 
NCAMlis widely expressed during embryonic development, but is highly down 
regulated in the adult (reviewed in Gegelashvili and Bock, 1996), and therefore ex- 

15 pressed at low levels in normal tissues from SCLC patients. It has already been 

demonstrated that NCAM1 expression is in part regulated by endocytosis (Minana et 
al. p 2001) and that NCAM1 can be induced to internalise by antibody binding 
(Michalides et al., 1994). Cadherins has also been found to be endocytosed under 
normal (Kamei et al., 1999; Le et al., 1999) and pathological conditions (reviewed in 

20 Parkes and Hart, 2000). Therefore, these molecules are potential candidates for 
surface receptors for gene transfer. 



Example 4 

25 Surface molecules expressed by SCLC cell lines identified bv other methods 

Several other cell surface, receptors have been shown to be expressed by many 
SCLC cell lines and therefore are also potential candidates for surface receptors for 
gene transfer. 

30 

The expression of high affinity transforming growth factor-beta receptors (TGF-|3 R) 
was demonstrated in several of the cell lines from the above panel by chemical 
crosslinking (Damstrup et al., 1993). By Northern blot analysis, the presence of the 
mRNA for TGF-p Rl was found in 9 of 9 SCLC, TGF-p Rll in 6 of 9 SCLC lines and 
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of TGF-p Rill (betaglycan) in 9 of 9 SCLC lines (Norgaard et al., 1996). Binding of 
thejigand to these receptors induce internalisation of the receptor (Anders et al., 
1997, DoreetaL, 2001). 

5 The presence of insulin-like growth factor receptors (IGF-R) mRNA has been deter- 
mined by RT-PCR and was found present in 14 of 14 examined SCLC lines (Quinn 
et al., 1996). The presence of both IGF-R1 (Rotsch et al., 1992) and IGF-RII 
(Schardt et al., 1993) in 1 1 of 1 1 SCLC lines was demonstrated by Northern blotting, 
competitive binding assays and chemical crosslinking. Both receptors are known to 
10 internalise after ligand binding (Dore et al., 1997) 

The epidermal growth factor receptor (EGF-R) and various homologues, variations 
or mutants (v-enb-B, HER2/neu (c-erb-2), ErbB3 and ErbB4 and EGF-R vlll) have 
been found expressed on a large number of cancer cell lines and tumours and sev- 

1 5 eral forms, internalise after ligand binding (reviewed in Wells, 19990; Huang and 

Harari, 1999). By Northern blot analysis 1 1 of the 21 SCLC lines in the above panel 
were found to express EGF-R. The expression was verified by radioreceptor and 
affinity labelling analysis in 10 of the cell lines (Damstrup et al., 1992). Indeed, the 
EGF-R has been demonstrated to mediate targeted gene delivery in several of 

20 these SCLC lines (Cristano and Roth, 1996, Frederiksen et al., 2000). 



Example 5 

25 Comparisons of gene expression of SCLC cell lines with gene expression of addi- 
tional types of normal human tissues and other tumour cell lines. 

In order to further compare gene expression profiles between SCLC cell lines and 
normal tissues, total RNA from normal tissues from leukocyte will be obtained from 
30 commercial sources (CLONTECH, Stratagene, Ambion or ResGen). Biotin labelled 
cRNA will be prepared as described above. 

It is of importance to determine if the genes highly expressed in SCLC cell com- 
pared to normal tissues is a phenomena general for cancer cells or is SCLC spe- 
cific. Therefore total RNA will be isolated from cell lines from other types of human 
35 cancers (e.g. commercially available cell lines derived from breast carcinoma, gli- 
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oma, non small cell lung cancer (NCLC), colon carcinoma, neuroblastoma) and 
analysed as described above. 

Comparison of gene expression of SCLC cell lines in vitro with gene expression 
5 from SCLC cells in vivo. 

Analysis of gene expression of SCLC lines propagated in vivo 

Selected cell lines are propagated in vivo as xenografts in both flanks of BALB/c 
nude mice according to Rygaard et al„ 1992. When one of the tumours has reached 

10 the size of approximately 1 cm x 1 cm the mice will be sacrificed and tumours re- 
moved. For total RNA isolation from the tumour, the tumour will be stored for 24-48 
hours in RNA/ater™(Ambion) and subsequently removed from the storage solution 
and stored at -70°C until RNA preparation. Total RNA will be prepared from the tu- 
mours by extraction with Trizol (Life Technologies) according to manufacturers 

15 specifications. The total RNA will be further purified on RNeasy columns (Qiagen) 
according to the manufacturers method for RNA cleanup. Analysis of isolated total 
RNA and preparation of cDNA and biotin labelled cRNA and analysis of gene ex- 
pression by Affymetrix Chips will be performed as described above. Protein extracts 
for Western blot analysis are prepared from freshly removed tumours by homogeni- 

20 sation on ice with a teflon pestel in 5 volumes (w/v) of 20 mM Tris-CI (pH 7.5), 2 % 
Triton X-100 with addition of protease and phophatase inhibitors (Protease Inhibitor 
Cocktail Set III and Phosphatase Inhibitor Cocktail Set II from Calbiochem) and 
subsequent clearing by high speed centrifugation (13.000 x g). 

25 Analysis of gene expression of biopsies from patients with small cell lung 
cancer 

Biopsies from patients with diagnosed small cell lung cancer (obtained from Herlev 
Hospital) will be stored for 24-72 hours in RNA/ater™(Ambion) and subsequently 
removed from the storage solution and stored at -70°C. The tumours will be micro 
30 dissected by an experienced pathologist and RNA isolated from the tumours as 

above. RNA from several tumours will be pooled. Should the total RNA amount ob- 
tained not be sufficient for direct preparation of biotin labelled cDNA, the labelling 
procedure will be modified to include 2 further amplification steps as described in 
Ohyama et al., 2000. 

35 
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Example 6 

Experimental procedures for identification of cell surface molecules 

5 Candidate cell surface molecules (receptors) expressed by SCLC cells are identified 
by Gene Chip analysis, Northern blotting, RT-PCR or by Western blotting. The spe- 
cific splice form(s) expressed by the SCLC cells will be determined by RT-PCR 
and/or by sequencing (performed at GATC Biotech AG, Germany). The protein ex- 
pression and subcellular localization of molecules, which are identified only on 

10 mRNA level, must be verified by other methods. If commercially antibodies are 
available, identification by western blotting (using protein extracts prepared from 
SCLC cell lines from the above panel propagated in vitro and in vivo as described 
above) and immunostaining of SCLC cell lines will be performed using the manu- 
facturers recommendations. 

15 For molecules with known ligands, which are commercially available or can be pro- 
duced recombinants (see below), this can additionally or alternatively be accom- 
plished by binding or crosslinking studies. The labelled ligands (e.g. radio-, biotin- or 
fluorescent labelled ligands) will also be used to determine the affinity of the recep- 
tor, number of receptor molecules per cell and their ability for internalisation of the 

20 ligand. 

For cell surface molecules without known ligands, both the expression of the surface 
molecule and identification of ligands must be determined. As the mRNA encoding 
the cell surface molecule is readily available from the SCLC lines, the cDNA encod- 

25 ing the extracellular part can be cloned by standard RT-PCR methods into an ex- 
pression vector to allow expression of a recombinant protein to be used for immuni- 
zation. Preferably expression in a bacterial system (e.g. Qiagen pQE vectors) as a 
fusion with a suitable tag (e.g. 6 x HIS) for easy purification of the recombinant pro- 
tein will be used. Immunization for generation of polyclonal antibodies in rabbits will 

30 be performed at the Department of Experimental Medicine, The Panum Institute, 
University of Copenhagen. Generation of mouse monoclonal hybridomas will be 
performed at the Serum Institute, Copenhagen. Sera from irtimunized animals and 
conditioned medium from hybridomas will be screened for antigen binding using the 
recombinants produced protein as immobilized antigen (in microtiter wells or on 

35 membranes). In addition, the specificity of the antibodies on the surface molecule, 



t 
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when expressed by mammalian cells, must be performed. This will be achieved by 
cloning the cDNA encoding the full length molecule into an eukaryotic expression 
vector (e.g. pcDNA 3.1 from Invitrogen or pCMV-Tag from CLONETCH) using RT- 
PCR. After transient transfection of a cell line, which does not endogenously ex- 
5 press the molecule, the specificity of the antibodies will be determined using indirect 
immunofluorescence staining. 

When a suitable antibody or serum has been identified, the protein expression will 
be analysed by immunostaining on the SCLC cell lines grown in vitro and in vivo and 
additionally on SCLC biopsies to verify of expression both in vitro and in vivo. The 
10 expression in normal tissues will be evaluated using a human tissue array contahr 
ing 200 distinct tissue samples spotted on glass microscope slides (VastArray™ 
from GenRes). 

Alternatively, human single chain antibodies isolated from a phage display library 
can be utilized (see below). 

15 

Example 7 

Experimental procedures for identification of ligands to a cell surface molecule and 
determination of their capacity for internalisation 
20 ~ ~ ~ ' 

Known ligands, which are commercially available, will, when possible, be obtained in 
either a radio-, biotin- or fluorescent labelled form. For analyses of integrins as can- 
didate surface molecules, the specific integrin alpha and beta subunit combination 
found in the cell lines must first be determined to identify the extracellular matrix 
25 ligand. This can be performed by immunostaining, as many antibodies against spe- 
cific integrin combinations are commercially available. 

If the ligand is commercially available, but not in a labelled form, the ligand can be 
labelled with 125 l (e.g. using the chloramine-T method) or with a fluorescent dye or 
biotin (e.g. using FluoReporter Kits from Molecular Probes). Binding assays will be 
30 performed to determine the specificity and capacity of ligand binding to the surface 
molecule. Using the labelled ligand, the ability of the surface molecule to internalise 
at 37°C (with incubation at (i-4 0 C as control) can be monitored after stripping of ex- 
ternally bound ligand (e.g. by acid or protease treatment) and measurement of inter- 
nalised radioactivity for radio-labelled ligand; staining with enzyme or fluorescent 
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labelled streptavidine for biotin labelled ligand or direct evaluation for fluorescent 
labelled ligand by microscopy. 

If the ligand is known, but not commercially available, the gene encoding the ligand 
will be cloned into an expression vector using RT-PCR or obtaining a cDNA library 
5 from a suitable tissue or cell line or (when available) obtain the clone from commer- 
cial sources (GeneStorm® clones from Invitrogen or GeneConnection™ from 
CLONTECH). A suitable tag (e.g. 6 x HIS) should be included in the recombinant 
ligand for easy purification. A bacterial expression system will be preferred. Recom- 
binant expression will also enable the possibility to express the ligand as a fusion 

10 with EGFP for facilitating the analysis of binding and internalisation. Alternatively, 
antibodies against the tag can be used for analysis of binding and internalisation. 
However, should posttranslational modifications such as glycosylation or sulfatation 
be essential for binding of the ligand to its receptor, expression as a secreted protein 
can be achieved in a yeast system (Pichia pastorius), in a insect system (Baculoyi- 

15 rus) or in mammalian cells (e.g. HEK293, COS-7 or CHO cells). 

If the ligand of a cell surface molecule is unknown, homology studies based on the 
genomic sequence or amino acid sequence of the receptor may result in identifica- 
tion of a superfamily of receptors to which the particular receptor belongs. A panel of 
ligands specific for this superfamily can then be tested using the methods described 

20 above. Alternatively, screening with a bacterial peptide expression library (e.g. FliTrx 
Random Peptide Display Library from Invitrogen) may identify of one or more pep- 
tide ligands. These peptide ligands can subsequently either be cloned for recombi- 
nant expression or obtained commercially. For this screening it would be optimal to 
use a cell line, which does not express the candidate surface molecule as screening 

25 for non-specific binding and the same cell line transfected with an expression plas- 
mid for the surface molecule for identification of specific peptide ligands. 
If mouse monoclonal antibodies towards the cell surface molecule have been gen- 
erated, an alternative is to screen these antibodies for the capacity of internalising 
by detection of endocytosed antibodies by fluorescent labelled anti-mouse antibod- 

30 ies. Recombinantly expressed single chain antibodies cloned from the antibody pro- 
ducing hydbridoma will also be tested. For clinical trials, these antibodies must be 
humanized for example by the method described in Losman et al., 1999. If no inter- 
nalising monoclonal antibodies are available, a phage library expressing human 
single chain antibody fragments cgn be used for isolation of internalising antibodies. 

35 By removing unspecrfic binding of phage displayed antibodies by incubation with a 
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cell line negative for the cell surface molecule in question and selection with a 
transfected cell line expressing the molecule (as described above) specific and in- 
ternalising antibodies can be identified and subsequently cloned from the phagemid 
DNA taken up by the cell after endocytosis(Nielsen and Marks, 2000; Heitner et al., 
5 2001)(collaboration with Prof. J, Engberg, Royal Danish School of Pharmacy). 

Example 8 



Identification of a promoter for expression of a therapeutic gene. 
10 , 

The promoter region from genes, whose expression by GeneChips analysis has 
been found to be high in SCLC cell lines and xenografts and low or negative in nor- 
mal tissues, are potential candidates to control and mediate expression of a thera- 
peutic gene in targeted gene therapy. The expression by candidate promoters de- 

15 termined by GeneChips analysis will first be verified by RT-PCR or Northern blotting 
using several different primer sets or probes covering the entire molecule on the 
same RNAs used for GeneChips analysis (from SCLC cells and normal tissues) to 
ensure the cancer cell specificity of the promoter (as alternatively spliced variants 
expressed by the same promoter in normal tissues may not be recognized by the 

20 Affymetrix Chip). As the activity and specificity of a promoter can be encoded in a 
very large portion of DNA> it is essential to define the region(s) of the promoter, 
which are sufficient for specific and high expression in SCLC cells in order to limit 
the size of the DNA encoding the therapeutic gene to enhance delivery by a surface 
molecule. We set this limit to 1 5 kb, which is within the feasible size for cloning by 

25 PCR. Initially, a region of approx 1 5 kb upstream from the coding region of the can- 
didate gene, including the region coding for the 5' untranslated part of the mRNA, 
will be cloned by PCR using a thermostabile polymerase, which is capable of ex- 
tending large PCR products with genomic DNA as template (e.g. Herculase from 
Stratagene). The primers used for PCR will be designed from the genomic se- 

30 quence in the HUGO database and will be designed to contain either rare restriction 
sites for cloning by restriction cleavage or to contain loxP sites for direct cloning 
without restriction cleavage by addition of Cre recombinase. The vector to be used 
for testing the promoter regions will be constructed to contain a promoterless gene 
encoding the Enhanced Green Fluorescent Protein (EGFP) from CLONTECH pre- 

35 ceded by rare restriction sites in the multiple cloning sites (e.g. pd2EGFP-1 from 
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CLONTECH) and/or a loxP site. The activity of the promoter will be estimated visu- 
ally in a semi quantitative manner after transfection into the SCLC lines (e.g. using 
Lipfectamine Plus™ from Life Technologies) using fluorescence microscopy or 
quantitatively using a fluorometer (e.g. Victor 1420 from Wallac). As control for 
5 transfection efficiency, a low amount of plasmid encoding a red fluorescent protein 
under the control of a CMV promoter (pDsRed2-N1 from CLONTECH) will be used. 

Promoters, which are active in the above assay, will be subcloned into smaller frag- 
ments (by PCR as described above or by standard restriction enzyme digestion) and 

10 tested for promoter activity as above. The relative activities of the promoters and 

subclones thereof can be determined quantitatively by recloning into a promoterless 
vector encoding a firefly luciferase and as transfection control, co-transfection with a 
plasmid encoding a renilla luciferase expressed from a SV40 promoter. Using the 
Dual-Luciferase® Reporter Assay System from Promega, the transcription from 

15 both plasmids in an extract of transiently transfected cells will be quantified using a 
luminometer (Lumat LB9507 from EG&G). In a similar manner, chimerics of the ac- 
tive parts of different, strong SCLC specific prompters can be tested for optimal ex- 
pression and regulation. Alternatively, addition of enhancer sequences from other 
genes (e.g. viral enhancers) can be inserted. To ensure that the specificity of the 

20 selected promoter regions for SCLC cells compared to normal tissues is not lost in 
the various constructions, these will additionally be tested by transfection into com- 
mercially available cell lines of various origin derived from normal tissues. If higher 
specificity is needed, an additional specificity for cancer cells with mutations in p53 
gene will be incorporated in the system. By inserting loxP sites adjacent to the pro- 

25 moter for the therapeutic gene and inserting the gene encoding Cre recombinase 
under the control of a p53 activated promoter, normal cells expressing wt p53 will 
express Cre recombinase which excises the promoter for the therapeutic gene, 
which therefore is not expressed. 

30 If the transcriptional activity of the tumour specific promoter is not sufficient to 

achieve high enough levels of transcript encoding the therapeutic gene, it will be 
possible to utilize the specific promoter for activation of a second tissue-unspecific, 
but highly active promoter e.g. CMV. An example of this system is the encoding of 
Cre recombinase by the specific promoter, which after expression in the tumour tis- 
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sue activates a CMV promoter by recombinational removal of a silencing element 
flanked by loxP sequences (Kijama et al. v 1999). 

In addition, the presence of endogenous transcriptional enhancers (e.g. steroid 
5 hormone receptor binding regions and receptors) will be determined. This will be 
analysed by transfection with the promoter controlling expression of E6FP or lucif- 
erase as described above, after addition of steroid hormones (e.g. retinoic acid, es- 
trogen, progesteron or glucocorticoids). If present, these will give the opportunity to 
enhance the expression of the therapeutic gene by adjuvant administration of the 
10 hormone. Alternatively, these sequences can be inserted into the promoter for en- 
hancement of transcriptional activity if the corresponding receptor is expressed by 
the SCLC cells. 

Example 9 

Optimisation methods for complexinq DNA with a liqand 

A complex formation between the DNA encoding the tissue specific promoter con- 
trolling expression of a therapeutic gene and the ligand must be achieved for spe- 

20 cific internalisation. Several different possibilities will be tested. Biotin labelled ligand 
bound via streptavidine to biotin labelled poly-cationic poly-L-lysine (PLL) will com- 
plex with negatively charged DNA, thus forming a compacted ligand/DNA polyplex, 
which can be internalised via the ligand (Frederiksen et al., 2000). Biotinylation of 
ligand and poly-L-lysin of different sizes can be performed as described by Cristiano 

25 et al., 1996 or Wagner et al., 1990. 

Alternatively, the commercially available branched cationic polymer polyethylen- 
imine (PEI) can be used for forming the ligand/DNA complex. PEI/DNA complexes 
in themselves have a low activity of gene transfer. However, the activity and speci- 
30 ficity can be substantially increased by covalent crosslinking of a ligand to PEI 

(Kircheis et al., 1997). Another possibility will be to test biotin labelled PEI combined 
with biotin labelled ligand and streptavidine, as described for PLL above. 

A further advantage of this system over using PLL is that inclusion of an endosomal 
35 lysis agent in the complex is unnecessary (see below). 



! 
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If the ligand is produced recombinant^, a different approach will also be tested. By 
including peptide sequences in the recombinant ligand, which can bind strongly to 
specific DNA sequences encoded in the DNA containing a therapeutic gene, it is 
5 possible to achieve a DNA/ligand complex, which then can be neutralized and com- 
pacted by PLL. The DNA binding domain from the yeast transcriptional activator 
GAL4 produced as a recombinant fusion with the ligand will be tested in this manner 
using DNA, where tandem repeats of the GAL4 recognition sequences have been 
incorporated into the DNA. 

10 

The above described complexes will initially be tested using DNA encoding EGFP 
controlled by a CMV promoter with a ligand known to bind a cell surface receptor 
capable of internalisation. The efficacy and specificity of will be determined by visual 
evaluation by fluorescence microscopy and/or by fluorometric quantification after 
15 administering to cells with and without expression of the receptor for the ligand. 

Example 10 

Optimisation of endosomal lysis of complex. 

20 

To avoid lysosomal degradation of the endocytosed complex, it is essential to in- 
clude a endosomal lysis agent in the complex for release of the DNA into the cyto- 
plasm or an agent such as Chloroquine, which raises the endosomal pH and 
thereby inhibits degradation by lysosomal enzymes (reviewed in Guy et al., 1995). 

25 Replication deficient adenovirus has been demonstrated as a potent endosmolytic 
agent, when directly coupled to the ligand/DNA polyplex (Yoshimura et al., 1993). 
However, the drawbacks of using deficient adenovirus or viral capsides is unwanted 
immunological response, unspecific uptake of the complex via viral receptors, safety 
precautions and difficulty in preparation and stability. Therefore, to avoid these dis- 

30 advantages and in order to reduce the size of the complex, smaller, preferably non 
viral endosomolytic agents will be tested. The influenza virus hemagglutinin HA-2 N- 
terminal fusogenic peptides (Wagner et al., 1992) , N-terminal rhino virus peptides, 
the pseudomonas exotoxin A translocation domain (Fominaya and Wels, 1996) and 
synthetic peptides (Gottschalk et al., 1996) have been found to mediate endosomal 

35 lysis or endosomal escape. Biotin labelled endosomolytic peptides can be included 
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in the ligand/DNA complex, when generated by biotin labelled poly-L-lysine (PLL) 
coupled to streptavidin. Alternatively, when the ligand is produced recombinant^, 
the peptide sequences can be included in the N- or C-terminal part of the ligand. 
The efficiency of these peptides (added either separately or incorporated into a re- 
5 combinant ligand) will be tested using DNA encoding a reporter gene (EGFP or lu- 
ciferase) controlled by a CMV promoter complexed to a ligand known to internalise 
and the endosomal lysis monitored by evaluation of expression of the reporter gene. 
If the ligand/DNA complex is assembled by PEI, this agent can alone mediate en- 
dosomal swelling and subsequent lysis and release of the complex (Boussif et al., 
10 1995). 

Example 11 

Optimisation of methods for protection and nuclear targeting of the therapeutic gene. 

15 

To enhance the transport of endosomally released DNA encoding the therapeutic 
gene to the nucleus, the DNA will be covalently linked to a peptide encoding a nu- 
clear targeting sequence (NLS - nuclear localization sequence). By excision of the 
therapeutic gene together with the promoter with restriction enzymes, protection of 

20 the DNA ends from digestion by exonucleases can be achieved by hybridisation to 
and ligation of oligonucleotides, which generate a protective stem-loop cap at the 
double stranded DNA ends. By including an amino-modified nucleotide in the oligo- 
nucleotide, this residue can be used for covalent crosslinking to a C-terminal ami- 
dated peptide encoding a nuclear localization signal (Zanta et al., 1999) (the pep- 

25 tides can be commercially obtained from e.g. Genosys, TX, USA). A number of po- 
tential sequences are mentioned herein above. Initially, the enhancement of expres- 
sion by coupling of a NLS peptide of simian virus 40 large tumour antigen to the 
DNA will be tested using a DNA fragment encoding EGFP with a CMV promoter and 
expression analysed by transient transfection of SCLC cell lines. Other peptides 

30 encoding NLS from other proteins (see herein above) will be tested for determina- 
tion of the most efficient nuclear transport. 



WO 03/000928 



PCT/IB02/03534 



170 

Example 12 

Experimental procedures for selection of therapeutic gene. 

5 

Potential therapeutic genes will be selected from the group of: apoptosis inducing 
gene products, toxic gene products, gene products which introduce sensitivity to- 
wards harmless drugs, antisense RNA for oncogenes, Ribozymes targeted against 
oncogenes or genes encoding antibodies against oncogenes. The cDNA encoding 

10 the gene products for expression of protein or antisense RNA will either be obtained 
by cloning via RT-PCR, PCR on a cDNA library or obtained from commercial 
sources. To evaluate the efficacy of therapeutic genes for promoting cell death, 
these will inserted into a vector under the control of a CMV promoter and the effect 
of expression tested after transient transfection (e.g. using LipofectaminePlus, Life 

15 Technologies) into SCLC cell lines from the above panel, using a plasmid express- 
ing EGFP for co-transfection for identification of transfected cells. For apoptosis in- 
ducing genes the effect of expression on transfected cells will be monitored by spe- 
cific staining (e.g. by use of Vybrant Apoptosis Assay Kit from Molecular Probes). In 
addition, cell death of transfected cells will be monitored by the use of fluorescent 

20 "live stains" (e.g. LIVE/DEAD Viability/Cytotoxicity Kit from Molecular Probes). 

Therapeutic genes selected from the experiments above will subsequently be re- 
cloned to be expressed under the control of one or more SCLC specific promoters 
and the efficiency of the expression analysed by transfection as described above. 



25 Example 13 

Transduction experiments in vivo. 

Once potential surface molecules and their ligands have been selected, a 
30 DNA/ligand complexing method including an endosomal lysis agent and nuclear 

targeting of a gene has been developed, the specificity and efficiency of the delivery 
system will be tested in vivo by administration of the complex to SCLC tumour 
xenografts of selected cell lines from the list above propagated in nude mice. A 
complex containing a reporter gene (e.g. p-galactosidase or EGFP) with a CMV 
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promoter in an appropriate pharmaceutical formulation will be administered to the 
tumour xenografted mice by intravenous injection in the tail vein. After 24, 48 or 72 
hours, the mice will be sacrificed and the tumours and tissues from lung, liver, heart, 
brain, spleen, kidney and skeletal muscle will be excised and stained or analysed for 
5 the product of the reporter gene (e.g. p-galactosidase). 

Transduction experiments using therapeutic genes in vivo. 

To test the ability of the DNA/ligand to deliver a therapeutic gene in vivo, transduc- 
10 tion experiments using therapeutic genes selected by in vitro experiments and 

DNA/ligand complexes selected from in vivo experiments will be performed as de- 
scribed above. If the therapeutic gene encodes a thymidine kinase, it will be accom- 
panied by administration of a nucleotide analogue (e.g. gangcyclovir). Turnout de- 
velopment will be monitored by size determination, flow cytometry of cells from bi- 
15 opsies and after sacrifice of the mice, the tumours will be analysed for apoptosis and 
necrosis. 
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Claims 

1 . A method for identifying a plurality of cell surface molecules, which are 

expressed at a different level in malignant cells compared with normal cells, 
5 comprising the steps of: 

i) providing at least 3 malignant cell lines selected from the group 
consisting of GPH 54 A, CPH 54 B, GLC 2, GLC 3, GLC 14, GLC 16, 
GLC 19, GLC 26, GLC 28, DMS 53, DMS 79, DMS 92, DMS 114, DMS 
153, DMS 273, DMS 406, DMS 456, NCI H69, NCI N417, MAR H24, 

10 MAR 86 Ml, SHP-77, NCI-H2171, NCI-H2195, NCI-H2196, NCI-H2198, 

NCI-H2227, NCI-H2286, NCI-H2330, NCI-H735, NCI-H1339, NCI- 
H1963, NCI-H2107, NCI-H2108, NCI-H1304, NCI-H1341, NCI-H1417, 
NCI-H1436, NCI-H1522, NCI-H1618, NCI-H1672, NCI-H1694, NCI- 
H1836, NCI-H1870, NCI-H1876, NCI-H1882, NCI-H1926, NCI-H1930, 

1 5 NCI-H1 994, NCI-H2029, NCI-H2059, NCI-H2066, NCI-H2081 , NCI- 

H2141, NCI-H211, NCI-H220, NCI-H250, NCI-H524, NCI-H592, NCI- 
H711, NCI-H719, NCI-H740, NCI-H748, NCI-H774, NCI-H841, NCI- 
H847, NCI-H865, NCI-H1048, NCI-H1059, NCI-H1092, NCI-H1105, 
NCI-H1184, NCI-H1238, NCI-H1284, NCI-H1688, NCI-H187, NCI-H378, 

20 NCI-H526, NCI-H660, NCI-H889, NCI-H60, NCI-H196, NCI-H446, NCI- 

H209, NCI-H146, NCI-H82, NCI-H460, NCI-H345, NCI-H510A, NCI-128, 
NCI-446,SW 1271; and 

ii) providing at least 3 total RNA samples derived from normal tissue 
selected from the group consisting of liver, heart, kidney, lung, adrenal 

25 gland, colon, pancreas, small intestine, spleen, skeletal muscle, trachea, 

prostate, placenta, salivary gland, testes, leucocytes, brain, adipose 
tissue, bladder, breast, cervix, esophagus, larynx, ovary, rectum, skin, 
spinal cord, stomach, thymus, thyroid and uterus; and 

iii) comparing the expression of mRNA in the cell lines according to i) and 
30 tissue samples according to ii); and 

iv) identifying nucleic acid sequences, wherein 

a) there is a difference between the amount of 
mRNA expressed in one or more cell lines 
according to i) and the amount of mRNA 
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expressed in one or more tissues according to ii); 
and/or 



b) 



there is essentially no difference in the amount of 
mRNA expressed in at least two cell lines 



according to i); and/or 



c) 



there is essentially no difference in the amount of 
mRNA expressed in at least two tissue samples 
according to ii); and 



10 

2. 

15 3. 
4. 

20 

5. 

25 



v) selecting among the nucleic acid sequences according to iv), nucleic 
acid sequences encoding for potential cell surface molecules. 

The method according to claim 1 f wherein step i) involves providing at least 5 
malignant cell lines. 

The method according to claim 1, wherein step ii) involves tissue samples 
derived from lung, liver, heart, and kidney. 

The method according to claim 1 , wherein step ii) involves providing at least 5 
total RNA samples. 

The method according to claim 1, wherein step iii) comprises the steps of: 

i) isolating RNA comprising mRNA from the malignant cell lines; and 

ii) preparing cDNA populations from said RNA, wherein one cDNA 



population is prepared from RNA isolated from one cell line or one tissue 
sample; and 

iii) labelling each cDNA population with a detectable label; and 

iv) providing solid supports on which an array of known nucleic acid 
sequences has been immobilised; and 

v) incubating each cDNA population with a solid support under conditions 
which allows for hybridisation; and 

vi) detecting said detectable label on the solid" supports 



6. 

35 



The method according to claim 5, wherein said detectable label is a 
fluorescent label. 
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7. The method according to claim 5, wherein the solid support is a glass plate. 

8. The method according to claim 5, wherein at least 1000 known nucleic acid 
5 sequences are immobilised on the solid support. 

9. The method according to claim 1 , wherein the difference in step iv), a) is at 
least 2-fold in mRNA expression. 

10 10. The method according to claim 1 , wherein the difference in step iv), a) is an in 
principle unlimited number of fold. 

1 1 . The method according to claim 1 , wherein step iv), b) involves at least 3 
malignant cell lines. 

15 

12. The method according to claim 1 , wherein step iv), c) involves at least 3 RNA 
samples. 

13. The method according to claim 1 , wherein nucleic acid sequences encoding 
20 for potential cell surface molecules according to step v) are selected according 

to information available in commonly accessible databases selected from the 
group consisting of PubMed (NCBI), Nucleotide (NCBI), Protein (NCBI), 
Structure (NCBI), OMIM (NCBI) and LocusLink (NCBI). 



25 14. The method according to claim 1 , wherein nucleic acid sequences encoding 

for potential cell surface molecules according to step v) are selected according 
to sequence homology with known cell surface molecules, there is at least 
20% sequence identify. 

30 15. The method according to claim 1 , wherein nucleic acid sequences encoding 

for potential cell surface molecules according to step v) are selected according 
to sequence homology with domains comprised within known cell surface 
molecules. 
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16. The method according to claim 1, wherein nucleic acid sequences encoding 
potential cell surface molecules according to step v) are selected such as the 
potential cell surface molecules comprise a domain selected from the group 
consisting of hydrophobic regions and potential glycosylation sites. 

17. A method of identifying first nucleic acid sequences, which are capable of 
directing expression of second nucleic acid sequences operably linked 
thereto, wherein the level of said expression is different in malignant cells 
compared with normal cells comprising the steps of: 



i) providing at least 3 malignant cell lines selected from the group 
consisting of CPH 54 A, CPH 54 B, GLC 2, GLC 3, GLC 14, GLC 16, 
GLC 19, GLC 26, GLC 28, DMS 53, DMS 79, DMS 92, DMS 1 14, DMS 
153, DMS 273, DMS 406, DMS 456, NCI H69, NCI N417, MAR H24, 

15 MAR 86 Ml, SHP-77, NCI-H2171, NCI-H2195, NCI-H2196, NCI-H2198, 

NCI-H2227, NCI-H2286, NCI-H2330, NCI-H735, NCI-H1339, NCI- 
H1963, NCI-H2107, NCI-H2108, NCI-H1304, NCI-H1341, NCI-H1417, 
NCI-H1436, NCI-H1522, NCI-H1618, NCMH1672, NCI-H1694, NCI- 
H1836, NCI-H1870, NCI-H1876, NCI-H1882, NCI-H1926, NCI-H1930, 

20 NCI-H1994, NCI-H2029, NCI-H2059, NCI-H2066, NCI-H2081, NCI- 

H2141, NCI-H211, NCI-H220, NCIrH250, NCI-H524, NCI-H592, NCI- 
H711. NCI-H719, NCI-H740, NCI-H748, NCI-H774, NCI-H841, NCI- 
H847, NCI-H865, NCI-H1048, NCMH1059, NCI-H1092, NCI-H1105, 
NCI-H1184, NCI-H1238, NCI-H1284, NCI-H1688, NCI-H187, NCI-H378, 

25 NCI-H526, NCI-H660, NCI-H889, NCI-H60, NCMH196, NCI-H446, NCI- 

H209, NCI-H146, NCI-H82, NCI-H460, NCI-H345, NCI-H510A, NCI-128, 
NCI-446 and SW 1271, and 

ii) providing at least 3 RNA samples derived from normal tissue samples 
derived from the group consisting of liver, heart, kidney, lung, adrenal 

30 gland, colon, pancreas, small intestine, spleen, skeletal muscle, trachea, 

prostate, placenta, salivary gland, testes, leucocytes, brain, adipose 
tissue, bladder, breast, cervix, esophagus, larynx, ovary, rectum, skin, 
spinal cord, stomach, thymus, thyroid and uterus; and 

iii) comparing the expression of mRNA in the cell lines according to i) and 
35 tissue samples according to ii); and 
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iv) identifying second nucleic acid sequences, wherein 

a) there is a difference between the amount of mRNA expressed in one or 
more cell lines according to i) and the amount of mRNA expressed in 
one or more tissues according to ii); and/or 

b) there is essentially no difference in the amount of mRNA expressed in 
at least two cell lines according to i); and/or 

c) there is essentially no difference in the amount of mRNA expressed in 
at least two tissue samples according to ii); and 

v) identifying first nucleic acid sequences operably linked to the second 
nucleotide sequences identified in step iv) 

The method according to claim 17, wherein step i) involves providing at least 5 
malignant cell lines. 

The method according to claim 17, wherein step ii) involves tissue samples 
derived from the group consisting of lung, liver, heart and kidney. 

The method according to claim 17, wherein step ii) involves providing at least 
5 RNA samples. 

The method according to claim 17, wherein step iii) comprises the steps of: 

i) isolating RNA from the malignant cell lines; and 

ii) preparing cDNA populations from said RNA, wherein one cDNA 
population is prepared from RNA isolated from one cell line or one tissue 
sample; and 

iii) labelling each cDNA population with a detectable label; and 

iv) providing solid supports on which an array of known nucleic acid 
sequences has been immobilised; and 

v) incubating each cDNA population with a solid support under conditions 
which allows for hybridisation; and 

vi) detecting said detectable label on the solid supports 

The method according to claim 21 , wherein said detectable label is a 
fluorescent label. 
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23. The method according to claim 21 , wherein the solid support is a glass plate. 

24. The method according to claim 21 , wherein at least 1000 different known 
5 nucleic acid sequences are immobilised on the solid support. 

25. The method according to claim 17, wherein the difference in step iv) t a) is at 
least 2-fold difference in mRNA expression. 

10 26. The method according to claim 1 7, wherein the difference in step iv), a) is an 
in principle unlimited number of fold. 

27. The method according to claim 17, wherein step iv), b) involves at least 3 
malignant cell lines. 

15 

28. The method according to claim 17, wherein step iv), c) involves at least 3 
tissue samples. 

29. The method according to claim 17, wherein any first nucleic acid sequence 
20 operably linked to a second nucleic acid sequence comprise at least 15,000 

base pairs upstream of the translation start codon of said second nucleic acid 
sequence on the chromosome. 

30. The method according to claim 17, wherein any first nucleic acid sequence 
25 operably linked to a second nucleic acid sequence comprise at least 5,000 

base pairs upstream of the translation start codon of said second nucleic acid 
sequence on the chromosome. 

31 . The method according to claim 17, wherein any first nucleic acid sequence 

30 operably linked to a second nucleic acid sequence comprises up to up to 5000 

base pairs upstream of the translation start codon of said second nucleic acid 
sequence on the chromosome. 



32. 

35 



The method according to claim 17, wherein any first nucleic acid sequence 
operably linked to a second nucleic acid sequence comprise intron sequences 
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found upstream of the translation start codon of said second nucleic acid 
sequence on the chromosome, 

33. The method according to claim claim 17, wherein any first nucleic acid 

5 sequence operably linked to a second nucleic acid sequence comprise intron 

sequences found downstream of the translation start codon of said second 
nucleic acid sequence oh the chromosome. 

34. The method according to claim 17, wherein any first nucleic acid sequence 
10 operably linked to a second nucleic acid sequence comprise an enhancer 

sequence located more than 10,000 base pairs upstream or downstream from 
the translation start codon of said second nucleic acid sequence on the 
chromosome. 

1 5 35. The method according to claim 1 7, wherein any first nucleic acid sequence 
operably linked to a second nucleic acid sequence comprise up to 5000 base 
pairs upstream from the translation start codon of said second nucleic acid 
sequence on the chromosome, from which at least 10 internal bp have been 
deleted. 

20 

36. The method according to claim 17, wherein any first nucleic acid sequence 
operably linked to a second nucleic acid sequence comprise up to 5000 base 
pairs upstream from the translation start codon of said second nucleic acid 
sequence on the chromosome, from which at least one silencer has been 

25 deleted. 

37. A targeting complex comprising: 

i) a binding partner capable of binding a cell surface molecule identified by 
the method according to any of the claims 1 to 16, wherein said cell 

30 surface molecule is selected from the group consisting of GRIA2,GRM8, 

ITGAV, ITGAE, NCAM1 , NPTXR, LRP8 and CHRNA5 ; and 

ii) a bioreactive species 



38. 

35 



The targeting complex according to claim 37, wherein the cell surface 
molecule comprises or essentially consists of GRIA2. 
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39. The targeting complex according to claim 37, wherein the cell surface 
molecule comprises or essentially consists of GRM8. 

5 40. The targeting complex according to claim 37, wherein the cell surface 
molecule comprises or essentially consists of ITGAV. 



41 . The targeting complex according to claim 37, wherein the cell surface 
molecule comprises or essentially consists of ITGAE. 

10 

42. The targeting complex according to claim 37, wherein the binding partner is 
selected from the group consisting ofL-glutamate, kainate, 5-(bromomethyl)-4- 
isoxazolepropionic acid, analogues of glutamate, substituted quinoxaline 2,3 
diones, GYKI52466, 5-l-Willardine, 5-F-Willardine, agonist and antagonist lig- 

1 5 ands to the AMPA ((RS)-a-Amino-3-hydroxy-5-methyl-4-isoxazolepropionic 

acid, NBQX, CNQX, DNQX, GYKI 52466, 6-Chlorokynurenic acid, JSTX, L- 
APA, L-SOP, ACPT, (R.S)-PPG, CPPG, MAP4, (S)-3,4-DCPG, vitronectin, 
cytactin, fibronectin, fibrinogen, laminin, MMP-2, osteopontin, prothrombin, 
thrombospondin, von Willebrandts Factor, recombinant fragments of L1CAM, 

20 salmosin, E-cadherin and peptides thereof, including the peptide: NRDKETKV, 

NCAM1 domain Ig Ml, NCAM1 domain Iglll and peptides thereof, peptides 
C3: ASKKPKRNIKA, D3: AKKERQRKDTU, D4: ARALNWGAKP, monoclonal 
antibody 123C3, NPTX1, NPTX2, taipoxin,TCBP49, Oxynor, ApoE2, ApoE3, 
ApoE4, peptides from ApoE (E 14 i : -155; LRKLRKRLLRDADDL and its tandem 

25 E ( i4i;-155) 2 ; LRKLRKRLLRDADDL-LRKLRKRLL RDADDL) reelin, nicotine, 

acetylcholine, a-bungarotoxin, carbachol and specific antibodies against any 
of said surface molecules. 



The targeting complex according to claim 37, wherein said cell surface 
molecule is capable of internalising the targeting complex. 

The targeting complex according to claim 37, wherein said bioreactive species 
comprises a nucleic acid. 

i 
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45. The targeting complex according to claim 44, wherein the nucleic acid 
comprises a second nucleic acid operably linked to a first nucleic acid 
sequence comprising an expression signal. 

5 46. A targeting complex comprising 

i) a binding partner capable of binding a cell surface molecule identified by 
the method according to any of the claims 1 to 16, wherein said cell 
surface molecule is capable of internalising the targeting complex; and 
10 ii) a bioreactive species comprising a nucleic acid sequence comprising a 

second nucleic acid operably linked to a first nucleic acid sequence 
comprising an expression signal, wherein said first nucleic acid 
sequence has been identified by the method according to any of claims 
17 to 36. 

15 

47. The targeting complex according to claim 46, wherein the cell surface mole- 
cule comprises or essentially consists of a cell surface molecule selected from 
the group consisting of NCAM1 , NPTXR, LRP8 and CHRNA5. 

48. The targeting complex according to claim 46, wherein the cell surface mole- 
20 cule comprises or essentially consists of NCAM1 . 

49. The targeting complex according to claim 46, wherein the cell surface mole- 
cule comprises or essentially consists of NPTXR. 

25 50. The targeting complex according to claim 46, wherein the cell surface mole- 
cule comprises or essentially consists of LRP8. 

51 . The targeting complex according to claim 46. wherein the cell surface mole- 
cule comprises or essentially consists of CHRNA5. 

30 

52. The targeting complex according to claim 46, wherein the binding partner is 
selected from the group consisting of NCAM1 domain Ig Ml, NCAM1 domain 
. Iglll and peptides thereof, peptides C3: ASKKPKRNIKA, D3: AK- 
KERQRKDTU, D4: ARALNWGAKP, monoclonal antibody 123C3, NPTX1, 

35 NPTX2, taipoxin,TCBP49, Oxynor, ApoE2, ApoE3, ApoE4, peptides from 
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ApoE (Ei4i;-i5s; LRKLRKRLLRDADDL and its tandem E (14 i;-i 5 5)2; 
LRKLRKRLLRDADDL-LRKLRKRLL RDADDL) reelin, nicotine, acetylcholine, 
oc-bungarotoxin, carbachol and specific antibodies to said surface molecules. 



53. The targeting complex according to claim 46, wherein said first nucleic acid 
sequence comprises an expression signal which direct a higher level of 
expression of said second nucleic acid sequence in malignant cells compared 
with non-malignant cells. 

10 

54. The targeting complex according to claim 46, wherein said first nucleic acid 
sequence comprise an expression signal which directs a lower level of 
expression of said second nucleic acid sequence in malignant cells compared 
with non-malignant cells. 

15 

55. The targeting complex according to any of claims 46 to 47, wherein said first 
nucleic acid sequence is selected from the group consisting of pro221, 
pro210, pro71,pro41,pro30, pro2, pro209, pro14, pro4, pro8, pro246, pro16, 
pro27, pro5, pro49, pro19, pro140, pro139, pro207, pro81, pro273 and pro362. 

20 

56. The targeting complex according to any of claims 46 to 47, wherein said first 
nucleic acid sequence comprises fragments of nucleotide sequences selected 
from the group consisting of pro221, pro210, pro71,pro41 I pro30, pro2, pro209, 
pro14, pro4, pro8, pro246, pro16, pro27, pro5, pro49, pro19, pro140, pro139, 

25 pro207, pro81 , pro273 and pro362. 



57. The targeting complex according to any of claims 46 to 47, wherein said first 
nucleic acid sequence comprises more than one fragment of nucleotide se- 
quences selected from the group consisting of pro221, pro210, pro71, pro41, 

30 pro30, pro2, pro209, pro14, pro4, pro8, pro246, pro16, pro27, pro5, pro49, 

pro19, pro140, pro139, pro207, pro81, pro273 and pro362.. 

58. The targeting complex according to any of claims 46 to 47, wherein said first 
nucleic acid sequence comprises pro221 or one or more fragments thereof. 



35 
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59. The targeting complex according to any of claims 46 to 47, wherein said first 
nucleic acid sequence comprises pro21 0 or one or more fragments thereof 

60. The targeting complex according to any of claims 46 to 47, wherein said first 
nucleic acid sequence comprises prb71 or one or more fragments thereof 

61 . The targeting complex according to claim 46, wherein said first nucleic acid 
sequence further comprises nucleic acid sequences not natively associated 
therewith. 

62. The targeting complex according to claim 61 , wherein said nucleic acid 
sequences not natively associated therewith is a steroid hormone receptor 
binding site. 



1 5 63. The targeting complex according to any of claims 44 and 46, wherein said 
second nucleic acid sequence encodes a therapeutic protein. 

64. The targeting complex according to any of claims 44 and 46, wherein said 
second nucleic acid sequence encodes or comprises an antisense RNA or 

20 part of an antisense RNA. 

65. The targeting complex according to any of claims 44 and 46, wherein said 
second nucleic acid sequence encodes or comprises a ribozyme. 

25 66. The targeting complex according to any of claims 44 and 46, wherein said 
antisense RNA or said ribozyme is targeted against RNA of an oncogene or 
proto-oncogene. 

67. The targeting complex according to claim 66, wherein said oncogene or proto- 
30 oncogene is selected from the group consisting of Ras, Raf, Myc, Syn, Pirn, 

BMM, FOP, Sis, KGF, Fms, Fig, Neu, Trk, Kit, Met, Src, Fyn, Mas, Fes/Fps, 
Tre, Mer, ABL, BCL3, int-2, Cym, Ets, Elk, RhoA, Ski, Wnt-5a, Spi-1, Rap2, 
p55and c-tyr. 
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68. The targeting complex according to any of claims 37 and 46, wherein the 
complex further comprises a protective capping, wherein said protective 
capping consists of nucleic acid sequences attached to the first and/or second 
nucleic acid sequences. 

5 

69. The targeting complex according to claim 68, wherein the protective capping 
comprises a modified nucleotide. 

70. The targeting complex according to claim 37, wherein said bioreactive species 
10 is a toxin. 

71 . The targeting complex according to claim 70, wherein said toxin is selected 
from the group consisting of ricin, diphteria toxin, pseudomonas exotoxin, 
streptozotocin or cholera toxin. 

15 

72. The targeting complex according to claim 37, wherein said bioreactive species 
is an inducer of apoptosis. 

73. The targeting complex according to claim 72, wherein said inducer of 
20 apoptosis is selected from the group consisting of retinoic acid, A23187, 

Okadaic Acid, Puromycin, Staurosporine, Thapsigargin, Actinomycin D, 
Camptothecin, Cycloheximide, Dexamethasone, Etoposide and 
Glucocorticoid. 



25 74. The targeting complex according to claim 37, wherein said bioreactive species 
comprises a radioisotope. 

75. The targeting complex according to claim 37,' wherein said bioreactive species 
comprises a cytostatica. 

30 

76. The targeting complex according to claim 37, wherein said bioreactive species 
comprises or essentially consists of a polypeptide. 



77. 

35 



The targeting complex according to claim 76, wherein said polypeptide is a 
therapeutic protein. 
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78. The targeting complex according to any of claims 63 and 77, wherein said 
therapeutic protein is a tumour suppressor. 

5 79. The targeting complex according to any of claims 63 and 77, wherein said 
therapeutic protein is an inducer of apoptosis. 

80. The targeting complex according to claim 63 and 77, wherein said therapeutic 
protein is a protein, which can contribute to a cell cycle arrest. 

10 

81 . The targeting complex according to any of claims 63 and 77, wherein said 
therapeutic protein is a protein capable of protecting the cell against a toxic 
agent. 

1 5 82. The targeting complex according to any of claims 63 and 77, wherein said 
therapeutic protein is protein capable of catalysing the synthesis of a toxic 
substance. 

83. The targeting complex according to any of claims 63 and 77, wherein said 
20 therapeutic protein is p53. 

84. The targeting complex according to any of claims 63 and 77, wherein said 
therapeutic protein is a tumor suppressor selected from the group consisting 
of p73, p16, Rb, APC, DCC, NF-1. NF-2, WT-1, MEN-1, MEN-II, BRCA1, VHL, 

25 FCC, MCC, MSH2, PTCH, DPCH, TSC2, CDKN2A and ARF. 

85. The targeting complex according to any of claims 63 and 77, wherein said 
therapeutic protein is an inducer of apoptosis selected from the group 
consisting of Fas/Apo1 t TNF, TRAIL, TGF-p, caspases, Bak, Bax , Bid, Bik 

30 and GZMB. 

86. The targeting complex according to any of claims 63 and 71\ which comprises 
more than one first nucleotide sequence encoding a therapeutic protein or 
more than one therapeutic protein, for example 2, such as 3, for example 4 

35 therapeutic protein. 
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87. The targeting complex according to any of claims 37 to 86, wherein the 
complex further comprises a nuclear targeting signal. 

. 5 88. The targeting complex according to claim 87, wherein the nuclear targeting 
signal is an oligopeptide. 

89. The targeting complex according to claim 87, wherein the nuclear targeting 
signal is the nuclear localisation signal of simian virus 40 large tumour antigen. 

10 

90. The targeting complex according to any of claim 37 to 89, wherein the 
complex further comprises a endosomal lytic agent. 

91 . The targeting complex according to claim 90, wherein the endosomal lytic 
1 5 agent is selected from the group consisting of polyethylenimine (PEI), a 

replication defective virus and a viral protein capside. 

92. The targeting complex according to any of claims 37 to 89, wherein the 
targeting complex further comprises chloroquine. 

20 

93. The targeting complex according to claim 90, wherein both the endosomal lytic 
agent comprises a membrane destabilising polypeptide. 

94. The targeting complex according to any of claims 37 to 93, wherein the 
25 binding partner associates with the bioreactive species via a nucleic acid 

binding agent covalently attached to said binding partner. 

95. The targeting complex according to claim 94, wherein the nucleic acid binding 
agent is selected from the group consisting of poly-L-lysine (PLL), spermine, 

30 spermidine and histone proteins. 



96. 



The targeting complex according to claim 94, wherein the nucleic acid binding 
agent is PLL comprising from 15 to 1000 residues. 
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97. The targeting complex according to any of claims 37 to 93, wherein the 

binding partner associates with the bioreactive species indirectly via a pair of 
specific interacting components wherein one component is covalently attached 
to the bioreactive species and the second component is covalently attached to 
5 the binding partner. 



98. The targeting complex according to claim 97, wherein said interacting 
components are biotin and streptavidin. 

10 99. Use of a cell surface molecule identified according to any of claims 1 to 16 as 
a drug target, wherein said drug target is capable of binding a binding partner 
and internalising said binding partner into cells expressing said cell surface 
molecule. 

15 1 00. The use according to claim wherein the cell surface molecule is selected from 
the group consisting of NCAM1, NPTXR, LRP8, CHRNA5, GRIA2, GRM8, IT- 
GAV f ITGAE, TNFRSF12, L1 CAM, GPR49 and TMEFF1. 

101 . A method of identifying and/or preparing specific binding partners comprising 
20 the steps of 

i) Providing a cell surface molecule identified by the method according to 
any of the claims 1 to 16, wherein the cell surface molecule is selected 
from the group consisting of TNFRSF12, L1CAM, GPR49 and TMEFF1\ 

25 and 

ii) Identifying and/or preparing binding partners capable of associating with 
said cell surface molecules 

102. The method according to claim 101, wherein said binding partners can be 
30 used in pharmaceutical compositions for the treatment of a premalignant 

and/or malignant conditions. 

103. The method according to claim 101, wherein said binding partner is prepared 
by the following steps 
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i) immunising an animal with said cell surface molecule or part of said cell 
surface molecule; and 

ii) obtaining antibodies from said animal; or 

iii) obtaining cells producing antibodies from said animal and obtaining 
antibodies from said cells 

104. The method according to claim 101, wherein said binding partners are 
selected from an expression library expressing polypeptides and/or 
oligopeptides. 

105. The method according to claim 101, wherein said binding partners are 
selected from a synthetic combinatorial library expressing polypeptides and/or 
oligopeptides. 



15 106. The method according to claim 101 , wherein said binding partner is identified 
by screening a phage display library of antibodies. 

107. The method according to claim 101, wherein said binding partner is identified 
by screening a phage display library of human antibodies. 

20 

108. The method according to claim 101, wherein said binding partners are 
selected, from a library of small chemical compounds 

109. An isolated and/or purified specific binding partner capable of associating with 
25 a cell surface molecule, which is expressed at a different level in malignant 

cells compared with normal cells, identified by the method according to any of 
the claims 101 to 108. 

110. The binding partner according to claim 109, wherein said binding partner 
30 comprises or essentially consists of a polypeptide. 

111. The binding partner according to claim 109, wherein said binding partner is an 
antibody or a fragment of an antibody. 
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112. The binding partner according claim 109, wherein said binding partner is a 
polyclonal antibody or a fragment thereof. 

113. The binding partner according to claim 109, wherein said binding partner is a 
5 monoclonal antibody or a binding fragment thereof. 

1 14. The binding partner according claim 109, wherein said binding partner is a 
murine monoclonal antibody. 



10 115. The binding partner according to claim 109, wherein said binding partner is a 
humanised antibody. 



116. The binding partner according to claim 109, wherein said binding partner is a 
human antibody identified from a phage display library. 

15 

117. The binding partner according to any of claims 1 1 1 to 1 16, wherein said 
antibody can interact with the extracellular part of the cell surface molecule. 



118. The binding partner according to any of claims 111 to 1 16, wherein said 
20 antibody can interact with a posttranslational modification of the extracellular 

part of the cell surface molecule. 



1 1 9. The binding partner according to any of claims 1 1 1 to 1 1 8, wherein said 
antibody is capable of being internalised upon association with said cell 
25 surface molecule. 



120. The binding partner according to claim 109, wherein said binding partner is a 
naturally occurring ligand for said cell surface molecule. 

30 121. The binding partner according to claim 109, wherein said binding partner is a 
recombinant^ produced ligand for said cell surface molecule. 

122. The binding partner according to claim 109, wherein. said binding partner is a 
viral protein. 



35 
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123. The binding partner according to claim 109, wherein said binding, partner is a 
viral capsid. protein. 

124. The binding partner according claim 109, wherein said binding partner is a 
5 recombinantly produced and comprise viral capsid protein sequences. 

125. The binding partner according to claim 109, wherein said binding partner is a 
small chemical compound. 

10 126. The targeting complex according to any of claims 37 to 98, wherein the 

binding partner is a binding partner according to any of claims 109 to 125. 

127. A method of identifying novel drug targets, comprising the steps of 

15 i) Providing a binding partner according to any of claims 1 09 to 1 25 

ii) Identifying potential drug targets capable of associating with said binding 
partner 

128. The method according to claim 127, wherein said drug target comprise a 

20 polypeptide, which is a cell surface molecule expressed at a different level in 

malignant cells compared with normal cells. 

129. A drug target identified by the method according to any of the claim 127 to 
128. 

25 

130. A complex comprising a cell surface molecule identified according to any of 
the claims 1 to 16 and a targeting complex according to any of the claims 37 
to 98. 

30 131. The complex according to claim 130, wherein the cell surface molecule is se- 
lected from the group consisting of NCAM1 , NPTXR, LRP8, CHRNA5, GRIA2, 
GRM8, ITGAV, ITGAE, TNFRSF12, L1CAM, GPR49, TMEFF. 



132. 

35 



A use of the binding partner according to any of claims 109 to 125 for the 
preparation of a targeting complex according to any of claims 37 to 98. 
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1 33. A pharmaceutical composition comprising of the targeting complex according 
to any of the claims 37 to 98 together with a pharmaceutical^ acceptable 
carrier. 

1 34. A method of treatment of a premalignant and/or malignant conditions in an 
individual in need thereof, comprising administering to said individual a 
pharmaceutical^ effective amount of the targeting complex according to any 
of the claims 37 to 98. 

135. The method according to claim 134, wherein said treatment is ameliorating 
treatment. 



136. The method according to claim 134, wherein said treatment is curative 
15 treatment. 

137. The method according to claim 134, wherein said treatment is prophylactic 
treatment. 

20 1 38. The method according to claim 1 34, wherein said condition is a cancer 
selected from the group consisting of melanoma, brain tumour, 
neuroblastoma, breast cancer, lung cancer, prostate cancer, cervix cancer, 
uterine cancer, ovarian cancer, leukaemia, colon cancer, rectum cancer and 
bladder cancer. 

25 

139. The. method according to claim 134, wherein said condition is lung cancer 
selected from the group comprising small cell lung cancer (SCLC) and non- 
small cell lung cancer (NSCLC). 

30 140. The method according to claim 134, wherein said condition is small cell lung 
cancer. 

141. The method according to claim 134, wherein said condition is breast cancer. 
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142. The method according to claim 134, wherein said clinical condition is a brain 
tumour selected from the group comprising glioblastomas, neuroblastomas, 
astrocytomas, oligodendrogliomas, meningiomas, medulloblastomas, 
neuronomas, ependymomas, craniopharingiomas, pineal tumours, germ cell 

5 tumours and schwannomas. 

143. The method according to claim 134, wherein said targeting complex is 
administrated parenterally. 

10 144. The method according to claim 134, wherein said targeting complex is 
administrated by intravenous injection. 

145. The method according to claim 134, wherein said targeting complex is 
administrated by subcutaneous injection. 

15 

146. The method according to claim 134, wherein said method further comprises 
one or more second treatments. 

147. The method according to claim 146, wherein said second treatments are 
20 selected from the group consisting of surgical treatment, chemotherapy, 

radiation therapy, therapy with cytokines, Hormone therapy, gene therapy, 
immunotherapy and treatments using laser light. 

148. A use of the targeting complex according to any of the claims 37 to 98 for the 
25 preparation of a medicament for the treatment of a premalignant and/or 

malignant conditions in an individual in need thereof. 

149. The use according to claim 148, wherein said treatment is ameliorating 
treatment. 



30 



150. The use according to claim 148, wherein said treatment is curative treatment. 



151. 



The use according to claim 148, wherein said treatment is prophylactic 
treatment. 
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152. The use according to claim 148, wherein said condition is a cancer selected 
from the group consisting of melanoma, brain tumour, neuroblastoma, breast 
cancer, lung cancer, prostate cancer, cervix cancer, uterine cancer, ovarian 
cancer, leukaemia, colon cancer, rectum cancer and bladder cancer. 

5 

153. The use according to claim 148, wherein said condition is lung cancer 
selected from the group comprising small cell lung cancer (SCLC) and non- 
small cell lung cancer (NSCLC). 

10 154. The use according to claim 148, wherein said condition is small cell lung 
cancer. 

155. The use according to claim 148, wherein said condition is breast cancer. 

15 156. The use according to claim 148, wherein said condition is a brain tumour 
selected from the group comprising glioblastomas, neuroblastomas, 
astrocytomas, oligodendrogliomas, meningiomas, medulloblastomas, 
neuronomas, ependymomas, craniopharingiomas, pineal tumours, germ cell 
tumours and schwannomas. 

20 

157. The use according to claim 148, wherein said medicament is suitable for 
parenteral administration. 

158. The use according to claim 148, wherein said medicament is suitable for 
25 intravenous injection. 

159. The use according to claim 148, wherein said medicament is suitable for 
subcutaneous injection. 

30 160. A use of a pharmaceutical^ effective amount of cell surface molecule 

identified according to any of claims 1 to 16 for the preparation of a vaccine. 



35 



161. 



A use of a pharmaceuticaily effective amount of a nucleic acid sequence 
encoding a cell surface molecule identified according to any of claims 1 to 16 
for the preparation of a vaccine. 
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162. The use according to any of daims 160 and 161 , wherein the cell surface 
molecule is selected from the group consisting of NCAM1 , NPTXR,.LRP8, 
CHRNA5, GRIA2, GRM8, ITGAV, ITGAE, TNFRSF12, L1CAM, GPR49 and 

5 TMEFF1. 

163. The use according to any of 160 and 161 , wherein said vaccine furthermore 
comprise a non-self antigen covalently linked to said cell surface molecule. 

10 164. The use according to any of 160 and 161, wherein said vaccine furthermore 
comprises second nucleic acid sequences encoding a non-self antigen linked 
to the nucleic acid sequences. 

165. The use according to any of 160 and 161 , wherein said vaccine furthermore 
1 5 comprise more than one antigen. 

166. The use according to any of claims 160 and 161, wherein said vaccine 
furthermore comprise an adjuvant. 

20 167. The use according to any of claims 160 and 161 , wherein said vaccine is 
suitable for ameliorating and/or curative and/or prophylactic treatment of a 
premalignant and/or malignant conditions. 
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